Alexander Jutland and Ben Simcns 



Condensed Matter 
Field Theory 

SECOND EDITION 




imbrldge-org/S^BOSSI 769754 



CAMttAlDGE 






This page intentionally left blank 




Condensed Matter Field Theory 

Second edition 



Modern experimental developments in condensed matter and ultracold atom physics present 
formidable challenges to theorists. This book provides a pedagogical introduction to quan- 
tum field theory in many particle physics, emphasizing the applicability of the formalism 
to concrete problems. 

This second edition contains two new chapters developing path integral approaches to 
classical and quantum nonequilibrium phenomena. Other chapters cover a range of topics, 
from the introduction of many-body techniques and functional integration, to renormaliza- 
tion group methods, the theory of response functions, and topology. Conceptual aspects and 
formal methodology are emphasized, but the discussion focuses on practical experimental 
applications drawn largely from condensed matter physics and neighboring fields. 

Extended and challenging problems with fully-worked solutions provide a bridge between 
formal manipulations and research-oriented thinking. Aimed at elevating graduate students 
to a level where they can engage in independent research, this book complements graduate 
level courses on many particle theory. 

Alexander Altland is Professor of Theoretical Condensed Matter Physics at the Institute 
of Theoretical Physics, University of Koln. His main areas of research include mesoscopic 
physics, the physics of interacting many particle systems, and quantum nonlinear dynamics. 



Benjamin D. Simons is Professor of Theoretical Condensed Matter Physics at the 
Cavendish Laboratory, University of Cambridge. His main areas of research include strongly 
correlated condensed matter systems, mesoscopic and ultracold atom physics. 




Condensed Matter Field Theory 

Second edition 

Alexander Altland and Ben Simons 



I Cambridge 

1 UNIVERSITY PRESS 




CAMBRIDGE UNIVERSITY PRESS 

Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, 

Sao Paulo, Delhi, Dubai, Tokyo 

Cambridge University Press 

The Edinburgh Building, Cambridge CB2 8RU, UK 

Published in the United States of America by Cambridge University Press, New York 

www.cambridge.org 

Information on this title: www. Cambridge. org/978052 1769754 
© A. Altland and B. Simons 2010 



This publication is in copyright. Subject to statutory exception and to the 
provision of relevant collective licensing agreements, no reproduction of any part 
may take place without the written permission of Cambridge University Press. 

First published in print format 2010 

ISBN-13 978-0-511-78928-1 eBook (NetLibrary) 

ISBN-13 978-0-521-76975-4 Hardback 



Cambridge University Press has no responsibility for the persistence or accuracy 
of urls for external or third-party internet websites referred to in this publication, 
and does not guarantee that any content on such websites is, or will remain, 
accurate or appropriate. 



Contents 



Preface page ix 

1 From particles to fields 1 





1.1 


Classical harmonic chain: phonons 


3 




1.2 


Functional analysis and variational principles 


11 




1.3 


Maxwell’s equations as a variational principle 


15 




1.4 


Quantum chain 


19 




1.5 


Quantum electrodynamics 


24 




1.6 


Noether’s theorem 


30 




1.7 


Summary and outlook 


34 




1.8 


Problems 


35 


2 


Second quantization 


39 




2.1 


Introduction to second quantization 


40 




2.2 


Applications of second quantization 


50 




2.3 


Summary and outlook 


83 




2.4 


Problems 


83 


3 


Feynman path integral 


95 




3.1 


The path integral: general formalism 


95 




3.2 


Construction of the path integral 


97 




3.3 


Applications of the Feynman path integral 


112 




3.4 


Problems 


146 




3.5 


Problems 


146 


4 


Functional field integral 


156 




4.1 


Construction of the many-body path integral 


158 




4.2 


Field integral for the quantum partition function 


165 




4.3 


Field theoretical bosonization: a case study 


173 




4.4 


Summary and outlook 


181 




4.5 


Problems 


181 



5 Perturbation theory 193 



V 



VI 



5.1 General structures and low-order expansions 194 

5.2 Ground state energy of the interacting electron gas 208 

5.3 Infinite-order expansions 223 

5.4 Summary and outlook 232 

5.5 Problems 233 

6 Broken symmetry and collective phenomena 242 

6.1 Mean- field theory 243 

6.2 Plasma theory of the interacting electron gas 243 

6.3 Bose-Einstein condensation and superfluidity 251 

6.4 Superconductivity 265 

6.5 Field theory of the disordered electron gas 301 

6.6 Summary and outlook 329 

6.7 Problems 331 

7 Response functions 360 

7.1 Crash course in modern experimental techniques 360 

7.2 Linear response theory 368 

7.3 Analytic structure of correlation functions 372 

7.4 Electromagnetic linear response 389 

7.5 Summary and outlook 399 

7.6 Problems 400 

8 The renormalization group 409 

8.1 The one-dimensional Ising model 412 

8.2 Dissipative quantum tunneling 422 

8.3 Renormalization group: general theory 429 

8.4 RG analysis of the ferromagnetic transition 444 

8.5 RG analysis of the nonlinear er-model 456 

8.6 Berezinskii-Kosterlitz-Thouless transition 463 

8.7 Summary and outlook 474 

8.8 Problems 475 

9 Topology 496 

9.1 Example: particle on a ring 497 

9.2 Homotopy 502 

9.3 0-Oterms 505 

9.4 Wess-Zumino terms 536 

9.5 Chern-Simons terms 569 

9.6 Summary and outlook 588 

9.7 Problems 588 

10 Nonequilibrium (classical) 602 

10.1 Fundamental questions of (nonequilibrium) statistical mechanics 607 

10.2 Langevin theory 609 



10.3 Boltzmann kinetic theory 623 

10.4 Stochastic processes 632 

10.5 Field theory I: zero dimensional theories 643 

10.6 Field theory II: higher dimensions 654 

10.7 Field theory III: applications 665 

10.8 Summary and Outlook 684 

10.9 Problems 684 

11 Nonequilibrium (quantum) 693 

11.1 Prelude: Quantum master equation 695 

11.2 Keldysh formalism: basics 700 

11.3 Particle coupled to an environment 716 

11.4 Fermion Keldysh theory (a list of changes) 720 

11.5 Kinetic equation 723 

11.6 A mesoscopic application 729 

11.7 Full counting statistics 745 

11.8 Summary and outlook 753 

11.9 Problems 753 

Index 766 



Preface 



In the past few decades, the field of quantum condensed matter physics has seen rapid 
and, at times, almost revolutionary development. Undoubtedly, the success of the field 
owes much to ground-breaking advances in experiment: already the controlled fabrication 
of phase coherent electron devices on the nanoscale is commonplace (if not yet routine), 
while the realization of ultra-cold atomic gases presents a new arena in which to explore 
strong interaction and condensation phenomena in Fermi and Bose systems. These, along 
with many other examples, have opened entirely new perspectives on the quantum physics 
of many-particle systems. Yet, important as it is, experimental progress alone does not, 
perhaps, fully explain the appeal of modern condensed matter physics. Indeed, in concert 
with these experimental developments, there has been a “quiet revolution” in condensed 
matter theory, which has seen phenomena in seemingly quite different systems united by 
common physical mechanisms. This relentless “unification” of condensed matter theory, 
which has drawn increasingly on the language of low-energy quantum field theory, betrays 
the astonishing degree of universality , not fully appreciated in the early literature. 

On a truly microscopic level, all forms of quantum matter can be formulated as a rnany- 
body Hamiltonian encoding the fundamental interactions of the constituent particles. How- 
ever, in contrast with many other areas of physics, in practically all cases of interest in 
condensed matter the structure of this operator conveys as much information about the 
properties of the system as, say, the knowledge of the basic chemical constituents tells us 
about the behavior of a living organism! Rather, in the condensed matter environment, 
it has been a long-standing tenet that the degrees of freedom relevant to the low-energy 
properties of a system are very often not the microscopic. Although, in earlier times, the 
passage between the microscopic degrees of freedom and the relevant low-energy degrees of 
freedom has remained more or less transparent, in recent years this situation has changed 
profoundly. It is a hallmark of many “deep” problems of modern condensed matter physics 
that the connection between the two levels involves complex and, at times, even controversial 
mappings. To understand why, it is helpful to place these ideas on a firmer footing. 

Historically, the development of modern condensed matter physics has, to a large extent, 
hinged on the “unreasonable” success and “notorious” failures of non-interacting theo- 
ries. The apparent impotency of interactions observed in a wide range of physical sys- 
tems can be attributed to a deep and far-reaching principle of adiabatic continuity : the 




X 



quantum numbers that characterize a many-body system are determined by fundamen- 
tal symmetries (translation, rotation, particle exchange, etc.). Providing that the integrity 
of the symmetries is maintained, the elementary “quasi-particle” excitations of an inter- 
acting system can be usually traced back “adiabatically” to those of the bare particle 
excitations present in the non-interacting system. Formally, one can say that the radius 
of convergence of perturbation theory extends beyond the region in which the pertur- 
bation is small. For example, this quasi-particle correspondence, embodied in Landau’s 
Fermi-liquid theory, has provided a reliable platform for the investigation of the wide 
range of Fermi systems from conventional metals to 3 helium fluids and cold atomic Fermi 
gases. 

However, being contingent on symmetry, the principle of adiabatic continuity and, with 
it, the quasi-particle correspondence, must be abandoned at a phase transition. Here, inter- 
actions typically effect a substantial rearrangement of the many-body ground state. In the 
symmetry-broken phase, a system may - and frequently does - exhibit elementary exci- 
tations very different from those of the parent non-interacting phase. These elementary 
excitations may be classified as new species of quasi-particle with their own characteristic 
quantum numbers, or they may represent a new kind of excitation - a collective mode - 
engaging the cooperative motion of many bare particles. Many familiar examples fall into 
this category: when ions or electrons condense from a liquid into a solid phase, translational 
symmetry is broken and the elementary excitations - phonons - involve the motion of many 
individual bare particles. Less mundane, at certain held strengths, the effective low-energy 
degrees of freedom of a two-dimensional electron gas subject to a magnetic held (the quan- 
tum Hall system) appear as quasi-particles carrying a rational fraction (!) of the elementary 
electron charge - an effect manifestly non-perturbative in character. 

This reorganization lends itself to a hierarchical perspective of condensed matter already 
familiar in the realm of particle physics. Each phase of matter is associated with a unique 
“non-interacting” reference state with its own characteristic quasi-particle excitations - a 
product only of the fundamental symmetries that classify the phase. While one stays within 
a given phase, one may draw on the principle of continuity to infer the influence of inter- 
actions. Yet this hierarchical picture delivers two profound implications. Firstly, within the 
quasi-particle framework, the underlying “bare” or elementary particles remain invisible 
(witness the fractionally charged quasi-particle excitations of the fractional quantum Hall 
fluid!). (To quote from P. W. Anderson’s now famous article “More is different,” ( Science 
177 (1972), 393-6), “the ability to reduce everything to simple fundamental laws does not 
imply the ability to start from those laws and reconstruct the universe.”) Secondly, while 
the capacity to conceive of new types of interaction is almost unbounded (arguably the 
most attractive feature of the condensed matter environment!), the freedom to identify 
non-interacting or free theories is strongly limited, constrained by the space of fundamen- 
tal symmetries. When this is combined with the principle of continuity, the origin of the 
observed “universality” in condensed matter is revealed. Although the principles of adia- 
batic continuity, universality, and the importance of symmetries have been anticipated and 
emphasized long ago by visionary theorists, it is perhaps not until relatively recently that 
their mainstream consequences have become visible. 




XI 



How can these concepts be embedded into a theoretical framework? At first sight, the 
many-body problem seems overwhelmingly daunting. In a typical system, there exist some 
10 23 particles interacting strongly with their neighbors. Monitoring the collective dynamics, 
even in a classical system, is evidently a hopeless enterprise. Yet, from our discussion above, 
it is clear that, by focussing on the coordinates of the collective degrees of freedom, one 
may develop a manageable theory involving only a restricted set of excitations. The success 
of quantum field theory in describing low-energy theories of particle physics as a successive 
hierarchy of broken symmetries makes its application in the present context quite natural. 
As well as presenting a convenient and efficient microscopic formulation of the many-body 
problem, the quantum field theory description provides a vehicle to systematically identify, 
isolate, and develop a low-energy theory of the collective field. Moreover, when cast as a field 
integral, the quantum field theory affords a classification of interacting systems into a small 
number of universality classes defined by their fundamental symmetries (a phenomenon not 
confined by the boundaries of condensed matter - many concepts originally developed in 
medium- or high-energy physics afford a seamless application in condensed matter). This 
phenomenon has triggered a massive trend of unification in modern theoretical physics. 
Indeed, by now, several sub-fields of theoretical physics have emerged (such as conformal 
field theory, random matrix theory, etc.) that define themselves not so much through any 
specific application as by a certain conceptual or methodological framework. 

In deference to the importance attached to the subject, in recent years a number of 
texts have been written on the subject of quantum field theory within condensed matter. 
It is, therefore, pertinent for a reader to question the motivation for the present text. 
Firstly, the principal role of this text is as a primer aimed at elevating graduate students 
to a level where they can engage in independent research. Secondly, while the discussion 
of conceptual aspects takes priority over the exposure to the gamut of condensed matter 
applications, we have endeavored to keep the text firmly rooted in practical experimental 
application. Thirdly, as well as routine exercises, the present text includes extended problems 
which are designed to provide a bridge from formal manipulations to research-oriented 
thinking. Indeed, in this context, readers may note that some of the “answered” problems 
are deliberately designed to challenge: it is, after all, important to develop a certain degree 
of intuitive understanding of formal structures and, sadly, this can be acquired only by 
persistent and, at times, even frustrating training! 

With this background, let us now discuss in more detail the organization of the text. 
To prepare for the discussion of field theory and functional integral techniques we begin in 
Chapter 1 by introducing the notion of a classical and a quantum field. Here we focus on 
the problem of lattice vibrations in the discrete harmonic chain, and its “ancestor” in the 
problem of classical and quantum electrodynamics. The development of field integral meth- 
ods for the many-body system relies on the formulation of quantum mechanical theories in 
the framework of the second quantization. In Chapter 2 we present a formal and detailed 
introduction to the general methodology. To assimilate this technique, and motivate some 
of the examples discussed later in the text, a number of separate and substantial appli- 
cations are explored in this chapter. In the first of these, we present (in second-quantized 
form) a somewhat cursory survey of the classification of metals and insulators, identifying a 




canonical set of model Hamiltonians, some of which form source material for later chapters. 
In the case of the one-dimensional system, we will show how the spectrum of elementary 
collective excitations can be inferred using purely operator methods within the framework 
of the bosonization scheme. Finally, to close the chapter, we will discuss the application of 
the second quantization to the low-energy dynamics of quantum mechanical spin systems. 
As a final basic ingredient in the development of the quantum field theory, in Chapter 3 we 
introduce the Feynman path integral for the single-particle system. As well as represent- 
ing a prototype for higher-dimensional field theories, the path integral method provides a 
valuable and recurring computational tool. This being so, we have included in this chapter 
a pedagogical discussion of a number of rich and instructive applications which range from 
the canonical example of a particle confined to a single or double quantum well, to the 
tunneling of extended objects (quantum fields), quantum dissipation, and the path integral 
formulation of spin. 

Having accumulated all of the necessary background, in Chapter 4 we turn to the formula- 
tion and development of the field integral of the quantum many-particle system. Beginning 
with a discussion of coherent states for Fermi and Bose systems, we develop the many- 
body path integral from first principles. Although the emphasis in the present text is on 
the field integral formulation, the majority of early and seminal works in the many-body 
literature were developed in the framework of diagrammatic perturbation theory. To make 
contact with this important class of approximation schemes, in Chapter 5 we explore the 
way diagrammatic perturbation series expansions can be developed systematically from the 
field integral. Employing the <^ 4 -theory as a canonical example, we describe how to explore 
the properties of a system in a high order of perturbation theory around a known refer- 
ence state. To cement these ideas, we apply these techniques to the problem of the weakly 
interacting electron gas. 

Although the field integral formulation provides a convenient means to organize pertur- 
bative approximation schemes as a diagrammatic series expansion, its real power lies in 
its ability to identify non-trivial reference ground states, or “mean-fields,” and to provide 
a framework in which low-energy theories of collective excitations can be developed. In 
Chapter 6, a fusion of perturbative and mean-field methods is used to develop analyti- 
cal machinery powerful enough to address a spectrum of rich applications ranging from 
metallic magnetism and superconductivity to superfluidity. To bridge the gap between the 
(often abstract) formalism of the field integral, and the arena of practical application, it is 
necessary to infer the behavior of correlation functions. Beginning with a brief survey of con- 
cepts and techniques of experimental condensed matter physics, in Chapter 7 we highlight 
the importance of correlation functions and explore their connection with the theoretical 
formalism developed in previous chapters. In particular, we discuss how the response of 
many-body systems to various types of electromagnetic perturbation can be described in 
terms of correlation functions and how these functions can be computed by field theoretical 
means. 

Although the field integral is usually simple to formulate, its properties are not always 
easy to uncover. Amongst the armory of tools available to the theorist, perhaps the 
most adaptable and versatile is the method of the renormalization group. Motivating 




Xlll 



our discussion with two introductory examples drawn from a classical and a quantum 
theory, in Chapter 8 we become acquainted with the renormalization group method as a 
concept whereby nonlinear theories can be analyzed beyond the level of plain perturbation 
theory. With this background, we then proceed to discuss renormalization methods in more 
rigorous and general terms, introducing the notion of scaling, dimensional analysis, and the 
connection to the general theory of phase transitions and critical phenomena. To conclude 
this chapter, we visit a number of concrete implementations of the renormalization group 
scheme introduced and exemplified on a number of canonical applications. 

In Chapter 9, we turn our attention to low-energy theories with non-trivial forms of 
long-range order. Specifically, we will learn how to detect and classify topologically non- 
trivial structures, and to understand their physical consequences. Specifically, we explore the 
impact of topological terms (i.e. 0-terms, Wess-Zumino terms, and Chern-Simons terms) 
on the behavior of low-energy field theories solely through the topology of the underlying 
field configurations. Applications discussed in this chapter include persistent currents, ’t 
Hooft’s 0-vacua, quantum spin chains, and the quantum Hall effects. 

So far, our development of field theoretic methodologies has been tailored to the consid- 
eration of single-particle quantum systems, or many-body systems in thermal equilibrium. 
However, studies of classical nonequilibrium systems have a long and illustrious history, 
dating back to the earliest studies of thermodynamics, and these days include a range of 
applications from soft matter physics to population dynamics and ecology. At the same time, 
the control afforded by modern mesoscopic semiconducting and metallic devices, quantum 
optics, as well as ultracold atom physics now allow controlled access to quantum systems 
driven far from equilibrium. For such systems, traditional quantum field theoretical method- 
ologies are inappropriate. 

Starting with the foundations of non-equilibrium statistical mechanics, from simple one- 
step processes, to reaction-diffusion type systems, in Chapter 10 we begin by developing 
Langevin and Fokker -Planck theory, from which we establish classical Boltzmann trans- 
port equations. We then show how these techniques can be formulated in the language of 
the functional integral developing the Doi-Peliti and Martin-Siggia-Rose techniques. We 
conclude our discussion with applications to nonequilibrium phase transitions and driven 
lattice gases. These studies of the classical nonequilibrium system provide a platform to 
explore the quantum system. In Chapter 11, we develop the Keldysh approach to quantum 
non-equilibrium systems based, again, on the functional integral technique. In particular, 
we emphasize and exploit the close connections to classical nonequilibrium field theory, and 
present applications to problems from the arena of quantum transport. 

To focus and limit our discussion, we have endeavored to distill material considered 
“essential” from the “merely interesting” or “background.” To formally acknowledge and 
identify this classification, we have frequently included reference to material which we believe 
may be of interest to the reader in placing the discussion in context, but which can be 
skipped without losing the essential thread of the text. These intermissions are signaled in 
the text as “Info” blocks. 

At the end of each chapter, we have collected a number of pedagogical and instructive 
problems. In some cases, the problems expand on some aspect of the main text requiring only 




XIV 



an extension, or straightforward generalization, of a concept raised in the chapter. In other 
cases, the problems rather complement the main text, visiting fresh applications of the same 
qualitative material. Such problems take the form of case studies in which both the theory 
and the setting chart new territory. The latter provide a vehicle to introduce some core areas 
of physics not encountered in the main text, and allow the reader to assess the degree to 
which the ideas in the chapter have been assimilated. With both types of questions to make 
the problems more inclusive and useful as a reference, we have included (sometimes abridged, 
and sometimes lengthy) answers. In this context, Section 6.5 assumes a somewhat special 
role: the problem of phase coherent electron transport in weakly disordered media provides 
a number of profoundly important problems of great theoretical and practical significance. 
In preparing this section, it became apparent that the quantum disorder problem presents 
an ideal environment in which many of the theoretical concepts introduced in the previous 
chapters can be practiced and applied - to wit diagrammatic perturbation theory and series 
expansions, mean-field theory and collective mode expansions, correlation functions and 
linear response, and topology. We have therefore organized this material in the form of an 
extended problem set in Chapter 6. 

This concludes our introduction to the text. Throughout, we have tried to limit the range 
of physical applications to examples which are rooted in experimental fact. We have resisted 
the temptation to venture into more speculative areas of theoretical condensed matter at 
the expense of excluding many modern and more-circumspect ideas which pervade the con- 
densed matter literature. Moreover, since the applications are intended to help motivate and 
support the field theoretical techniques, their discussion is, at times, necessarily superficial. 
(For example, the hundreds pages of text in this volume could have been invested in their 
entirety in the subject of superconductivity!) Therefore, where appropriate, we have tried 
to direct interested readers to the more specialist literature. 

In closing, we would like to express our gratitude to Jakob Muller-Hill, Tobias Micklitz, 
Jan Muller, Natalja Strelkova, Franjo-Frankopan Velic, Andrea Wolff, and Markus Zowislok 
for their invaluable assistance in the proofreading of the text. Moreover, we would also like 
to thank Julia Meyer for her help in drafting problems. Finally we would like to acknowledge 
Sasha Abanov for his advice and guidance in the drafting of the chapter on Topology. 

As well as including additional material on the formulation of functional field integral 
methods to classical and quantum nonequilibrium physics in Chapters 10 and 11, in prepar- 
ing the second edition of the text, we have endeavored to remove some of the typographical 
errors that crept into the first edition. Although it seems inevitable that some errors will 
still have escaped identification, it is clear that many many more would have been missed 
were it not for the vigilance of many friends and colleagues. In this context, we would partic- 
ularly like to acknowledge the input of Piet Brouwer, Christoph Bruder, Clrung-Pin Chou, 
Jan von Delft, Karin Everschor, Anclrej Fischer, Alex Gezerlis, Sven Gnutzmann, Colin 
Kiegel, Tobias Luck, Patrick Neven, Achirn Rosclr, Max Schafer, Matthias Sitte, Nobuhiko 
Taniguclii, and Matthias Vojta. 




1 

From particles to fields 



To introduce some fundamental concepts of field theory, we begin by considering two simple model 
systems - a one-dimensional "caricature” of a solid, and a freely propagating electromagnetic wave. 
As well as exemplifying the transition from discrete to continuous degrees of freedom, these examples 
introduce the basic formalism of classical and quantum field theory, the notion of elementary excitations, 
collective modes, symmetries, and universality - concepts which will pervade the rest of the text. 



One of the more remarkable facts about condensed matter physics is that phenomenology of 
fantastic complexity is born out of a Hamiltonian of comparative simplicity. Indeed, it is not 
difficult to construct microscopic “condensed matter Hamiltonians” of reasonable generality. 
For example, a prototypical metal or insulator might be described by the many-particle 
Hamiltonian, H = H e + Hi + H e ; where 

P 2 . 1 

H e — + Fi<j ^ee( r i ~ r j)i 

Hi = + (L1) 

He i = E,/Ki(R/-r 4 ). 

Here, r.; (R/) denote the coordinates of the valence electrons (ion cores) and H e , Hi, and H e i 
describe the dynamics of electrons, ions and the interaction of electrons and ions, respec- 
tively (see Fig. 1.1). Of course, the Hamiltonian Eq. (1.1) can be made “more realistic,” for 
example by remembering that electrons and ions carry spin, adding disorder, or introduc- 
ing host lattices with multi-atomic unit-cells. However, for developing our present line of 
thought the prototype H will suffice. 

The fact that a seemingly innocuous Hamiltonian like Eq. (1.1) is capable of generating the 
vast panopticon of metallic phenomenology can be read in reverse order: one will normally 
not be able to make theoretical progress by approaching the problem in an u ab initio ” 
manner, i.e. by an approach that treats all microscopic constituents as equally relevant 
degrees of freedom. 

How then can successful analytical approaches be developed? The answer to this question 
lies in a number of basic principles inherent in generic condensed matter systems. 



1 




2 



From particles to fields 




Figure 1.1 A one-dimensional cartoon of a (metallic) solid. Positively charged ions located at 
positions Rj are surrounded by a conduction electron cloud (electron coordinates denoted by r ,). 
While the motion of the ions is massively constrained by the lattice potential Vji (indicated by the 
solid line and its harmonic approximation shown dashed), t he dynamics of the electrons is affected 
by their mutual interaction ( V ee ) and their interaction with the core ions (14i). 

1. Structural reducibility: Not all components of the Hamiltonian (1.1) need to be treated 
simultaneously. For example, when the interest is foremost in the vibrational motion of 
the ion lattice, the dynamics of the electron system can often be neglected or, at least, 
be treated in a simplistic manner. Similarly, much of the character of the dynamics of 
the electrons is independent of the ion lattice, etc. 

2. In the majority of condensed matter applications, one is interested not so much in the 
full profile of a given system, but rather in its energetically low-lying dynamics. This is 
motivated partly by practical aspects (in daily life, iron is normally encountered at room 
temperature and not at its melting point), and partly by the tendency of large systems 
to behave in a “universal” manner at low temperatures. Here universality implies that 
systems differing in microscopic detail (e.g. different types of interaction potentials, ion 
species, etc.) exhibit common collective behavior. As a physicist, one will normally seek 
for unifying principles in collective phenomena rather than to describe the peculiarities of 
individual species. However, universality is equally important in the practice of condensed 
matter theory. It implies, for example, that, at low temperatures, details of the functional 
form of microscopic interaction potentials are of secondary importance, i.e. that one may 
employ simple model Hamiltonians. 

3. For most systems of interest, the number of degrees of freedom is formidably large with 
N = 0(1O 23 ). However, contrary to first impressions, the magnitude of this figure is 
rather an advantage. The reason is that in addressing condensed matter problems we 
may make use of the concepts of statistics and that (precisely due to the largeness of 
N) statistical errors tend to be negligibly small. 1 

4. Finally, condensed matter systems typically possess a number of intrinsic symmetries. 
For example, our prototype Hamiltonian above is invariant under simultaneous trans- 
lation and rotation of all coordinates, which expresses the global Galilean invariance of 
the system (a continuous set of symmetries). Spin rotation invariance (continuous) and 



1 The importance of this point is illustrated by the empirical observation that the most challenging systems in 

physical sciences are of medium (and not large) scale, e.g., metallic clusters, medium-sized nuclei or large atoms 
consist of 0(10 1 -10 2 ) fundamental constituents. Such problems are well beyond the reach of few-body quantum 
mechanics while not yet accessible to reliable statistical modeling. Often the only viable path to approaching 
systems of this type is massive use of phenomenology. 




1.1 Classical harmonic chain: phonons 



3 



time-reversal invariance (discrete) are other examples of frequently encountered sym- 
metries. The general importance of symmetries cannot be over emphasized: symmetries 
entail the conservation laws that simplify any problem. Yet in condensed matter physics, 
symmetries are “even more” important. A conserved observable is generally tied to an 
energetically low-lying excitation. In the universal low-temperature regimes in which we 
will typically be interested, it is precisely the dynamics of these low-level excitations that 
governs the gross behavior of the system. In subsequent sections, the sequence “symme- 
try i — y conservation law i — > low-lying excitations” will be encountered time and again. At 
any rate, identification of the fundamental symmetries will typically be the first step in 
the analysis of a solid state system. 

To understand how these basic principles can be used to formulate and explore “effective 
low-energy” field theories of solid state systems we will begin our discussion by focussing on 
the harmonic chain; a collection of atoms bound by a harmonic potential. In doing so, we 
will observe that the universal characteristics encapsulated by the low-energy dynamics 2 of 
large systems relate naturally to concepts of field theory. 



1.1 Classical harmonic chain: phonons 

Returning to the prototype Hamiltonian (1.1) discussed earlier, let us focus on the dynamical 
properties of the positively charged core ions that constitute the host lattice of a crystal. 
For the moment, let us neglect the fact that atoms are quantum objects and treat the ions 
as classical entities. To further simplify the problem, let us consider an atomic chain rather 
than a generic d-dimensional solid. In this case, the positions of the ions can be specified 
by a sequence of coordinates with an average lattice spacing a. Relying on the reduction 
principle (1) we will first argue that, to understand the behavior of the ions, the dynamics 
of the conduction electrons are of secondary importance, i.e. we will set H e = H e \ = 0. 

At strictly zero temperature, the system of ions will be frozen out, i.e. the one-dimensional 
ion coordinates Ri = Ri = la settle into a regularly spaced array. Any deviation from 
a perfectly regular configuration has to be paid for by a price in potential energy. For 
low enough temperatures (principle 2), this energy will be approximately quadratic in the 
small deviation from the equilibrium position. The reduced low-energy Hamiltonian of 
our system then reads 



h = E 




y (Ri+i ~ Ri ~ a ) 2 > 



( 1 . 2 ) 



where the coefficient k s determines the steepness of the lattice potential. Notice that H can 
be interpreted as the Hamiltonian of N point-like particles of mass M elastically connected 
by springs with spring constant k s (see Fig. 1.2). 



In this text, we will focus on the dynamical behavior of large systems, as opposed to their static structural 
properties. In particular, we will not address questions related to the formation of definite crystallographic 
structures in solid state systems. 




4 



From particles to fields 



— ► -► h M K 

1 1 1 1 1 1 

(7-1 Ja la (1 + 1 )a 

Figure 1.2 Toy model of a one-dimensional solid; a chain of elastically bound massive point parti- 
cles. 



Lagrangian formulation and equations of motion 

What are the elementary low- 
energy excitations of the system? To 
answer this question we might, in 
principle, attempt to solve Hamil- 
ton’s equations of motion. Indeed, 
since H is quadratic in all coordi- 
nates, such a program is, in this 
case, feasible. However, we must bear 
in mind that few of the problems 
encountered in general solid state 
physics enjoy this property. Further, 
it seems unlikely that the low-energy dynamics of a macroscopically large chain - which 
we know from our experience will be governed by large-scale wave-like excitations - is 
adequately described in terms of an “atomistic” language; the relevant degrees of freedom 
will be of a different type. We should, rather, draw on the basic principles 1-4 set out 
above. Notably, we have so far paid attention neither to the intrinsic symmetry of the 
problem nor to the fact that N is large. 

Crucially, to reduce a microscopic model to an effective low-energy theory, the Hamil- 
tonian is often not a very convenient starting point. Usually, it is more efficient to start 
out from an action. In the present case, the Lagrangian action corresponding to a time 
interval [0,<o] is defined as S = f*° dt L(R,R ), where ( R,R ) = {Ri, Ri} symbolically rep- 
resents the set of all coordinates and their time derivatives. The Lagrangian L related to 
the Hamiltonian (1.2) is given by 



Joseph-Louis Lagrange 1736- 
1813 

A mathematician who excelled 
in all fields of analysis, number 
theory, and celestial mechanics. 

In 1788 he published Mecanique 
Analytique, which summarised all the work done in 
the field of mechanics since the time of Newton, 
and is notable for its use of the theory of differen- 
tial equations. In it he transformed mechanics into 
a branch of mathematical analysis. 




N 

l = t-u = J2 

i=i 



MR] 

2 



Ri+i — Ri — a) 2 



(1.3) 



where T and U denote respectively the kinetic and potential energy. 

Since we are interested in the properties of the large-IV system, we can expect boundary 
effects to be negligible. This being so, we are at liberty to impose on our atomic chain the 
topology of a circle, i.e. we adopt periodic boundary conditions identifying Rn+i = Ri- 
Further, anticipating that the effect of lattice vibrations on the solid is weak (i.e. long- 
range atomic order is maintained) we may assume that the deviation of the ions from their 





1.1 Classical harmonic chain: phonons 



5 



equilibrium position is small (\Ri(t) — i?/| <C a), and the integrity of the solid is maintained. 
With Ri(t) = Ri + (0N+1 = ^ 1 ) the Lagrangian (1.3) assumes the simplified form 

Nr -I 

L = ~2 ~ ~i ^ /+i 2 • 

7=1 L 

To make further progress, we will now make use of the fact that we are not concerned with 
the behavior of our system on “atomic” scales. (In any case, for such purposes a modeling 
like the one above would be much too primitive!) Rather, we are interested in experimentally 
observable behavior that manifests itself on macroscopic length scales (principle 2). For 
example, one might wish to study the specific heat of the solid in the limit of infinitely 
many atoms (or at least a macroscopically large number, 0(1O 23 )). Under these conditions, 
microscopic models can usually be substantially simplified (principle 3). In particular, it is 
often permissible to subject a discrete lattice model to a so-called continuum limit, i.e. 
to neglect the discreteness of the microscopic entities and to describe the system in terms 
of effective continuum degrees of freedom. 

In the present case, taking a continuum limit 
amounts to describing the lattice fluctuations 4>i 
in terms of smooth functions of a continuous vari- 
able x (see the figure where the [horizontal] dis- 
placement of the point particles has been plotted 
along the vertical). Clearly such a description makes sense only if relative fluctuations on 
atomic scales are weak (for otherwise the smoothness condition would be violated) . However, 
if this condition is met - as it will be for sufficiently large values of the stiffness constant 
k s - the continuum description is much more powerful than the discrete encoding in terms 
of the “vector” {4>i}- All steps that we will need to take to go from the Lagrangian to 
concrete physical predictions will be much easier to formulate. 

Introducing continuum degrees of freedom 4>(x), and applying a first-order Taylor expan- 
sion, 3 let us define 

4>I -t a 1/2 4>(x) , 4> I+ 1 -((>/->• a 3/2 d x (f)(x) , V' ->• - / dx , 

x—Ia x=Ia i ^ CL J q 

where L = Na. Note that, as defined, the functions <j){x,t) have dimensionality [length] 1 / 2 . 
Expressed in terms of the new degrees of freedom, the continuum limit of the Lagrangian 
then reads 

L[4>\= J dx £(</>, d x <t>,4>), C{4>, d x (f>, 4>)= ™<]) 2 - b^-idxf/)) 2 , (1.4) 




Indeed, for reasons that will become clear, higher-order contributions to the Taylor expansion are immaterial in 
the long-range continuum limit. 




6 



From particles to fields 




Figure 1.3 Schematic visualization of a field: a mapping <j> from a base manifold M into a target 
space T (in this case, T are the real numbers; but, in general, T can be more complicated). A 
functional assigns to each <j> a real number S[0]. The grid embedded into M indicates that fields in 
condensed matter physics arise as continuum limits of discrete mappings. 

where the Lagrangian density C has dimensionality [energy] / [length] and we have desig- 
nated the particle mass by the more common symbol m = M . Similarly, the classical action 
assumes the continuum form 



(1.5) 



We have thus succeeded in abandoning the IV-point particle description in favor of one 
involving continuous degrees of freedom, a (classical) field. The dynamics of the latter 
are specified by the functionals L and S, which represent the continuum generalizations 
of the discrete classical Lagrangian and action, respectively. 

INFO The continuum variable <j> is our first encounter with a field. Before proceeding with 
our example, let us pause to make some preliminary remarks on the general definition of these 
objects. This will help to place the subsequent discussion of the atomic chain into a broader 
context. Formally, a field is a smooth mapping 

<j>: M -> T, 

z <f>(z), 

from a certain manifold M, 4 often called the “base manifold,” into a “target” or “field manifold” 
T (see Fig. 1.3). 5 In our present example, M = [0, L\ x [0, t] C R 2 is the product of intervals in 
space and time, and T = R. In fact, the factorization ill cKxT into a space-like manifold 
TZ multiplied by a one-dimensional time-like manifold T is inherent in most applications of 
condensed matter physics. 6 




4 If you are unfamiliar with the notion of manifolds (for a crash course, see page 537), think of M and T as subsets 
of some vector space. For the moment, this limitation won’t do any harm. 

5 In some (rare) cases it becomes necessary to define fields in a more general sense (e.g. as sections of mathematical 
objects known as fiber bundles). However, in practically all condensed matter applications the more restrictive 
definition above will suffice. 

6 By contrast, the condition of Lorentz invariance implies the absence of such factorizations in relativistic field 
theory. In classical statistical field theories, i.e. theories probing the thermodynamic behavior of large systems, 
M is just space-like. 





1.1 Classical harmonic chain: phonons 



7 



However, the individual factors 1Z and T may, of course, be more complex than in our proto- 
typical problem above. As to the target manifold, not much can be said in general; depending 
on the application, the realizations of T range from real or complex numbers over vector spaces 
and groups to the “fanciest objects” of mathematical physics. 

In applied field theory, fields appear not as final objects but rather as input to functionals (see 
Fig. 1.3). Mathematically, a functional S : (f> H > S[0] £ R is a mapping that takes a field as its 
argument and maps it into the real numbers. The functional profile S[(j)] essentially determines 
the character of a field theory. Notice that the argument of a functional is commonly indicated 
in square brackets [ ] . 

While these formulations may appear unnecessarily abstract, remembering the mathematical 
backbone of the theory often helps to avoid confusion. At any rate, it takes some time and practice 
to get used to the concept of fields and functionals. Conceptual difficulties in handling these 
objects can be overcome by remembering that any field in condensed matter physics arises as 
the limit of a discrete mapping. In the present example, the field <j>(x) is obtained as a continuum 
approximation of the discrete vector {4>i} £ R^; the functional L[<f>] is the continuum limit of 
the function L : R^ — » R, etc. While in practical calculations fields are usually easier to handle 
than their discrete analogs, it is sometimes helpful to think about problems of field theory in 
a discrete language. Within the discrete picture, the mathematical apparatus of field theory 
reduces to finite-dimensional calculus. 



Although Eq. (1.4) contains the full information about the model, we have not yet learned 
much about its actual behavior. To extract concrete physical information from Eq. (1.4) we 
need to derive equations of motion. At first sight, it may not be entirely clear what is 
meant by the term “equations of motion” in the context of an infinite-dimensional model: 
the equations of motion relevant for the present problem are obtained as the generaliza- 
tion of the conventional Lagrange equations of IV-particle classical mechanics to a model 
with infinitely many degrees of freedom. To derive these equations we need to generalize 
Hamilton’s extremal principle (i.e. the route from an action to the associated equations of 
motion) to infinite dimensions. As a warm-up, let us briefly recapitulate how the extremal 
principle works for a system with one degree of freedom. 

Suppose the dynamics of a classical point particle with coordinate x(t) is described by the 
classical Lagrangian L(x,x), and action S'[x] = J dt L(x,x). Hamilton’s extremal prin- 
ciple states that the configurations x(t) that are actually realized are those that extremize 
the action, <5 S’ [a;] = 0. This means (for a substantiated discussion, see Section 1.2 below) 
that, for any smooth curve 1 1 -»- y(t), 

lim -(S [a; + ey\ — 5[x]) = 0. (1.6) 

To first order in e, the action has to remain invariant. Applying this condition, one finds 
that it is fulfilled if and only if x satisfies Lagrange’s equation of motion 



- d x L = 0. 
dt 



(1.7) 



Recapitulate the derivation of (1.7) from the classical action. 





From particles to fields 




Figure 1.4 Schematic showing the variation of the field associated with the action functional. 
Notice that the variation er/ is supposed to vanish on the boundaries of the base M = [0, L] x [0, t\. 



In Eq. (1.5) we are dealing with a system of infinitely many degrees of freedom, <fi(x, t ). Yet 
Hamilton’s principle is general and we may see what happens if Eq. (1.5) is subjected to an 
extremal principle analogous to Eq. (1.6). To do so, we substitute <p(x, t) —> <j)(x, t) + er/(x, t) 
into Eq. (1.5) and require vanishing of the first-order contribution to an expansion in e 
(see Fig. 1.4). When applied to the specific Lagrangian (1.4), substituting the “varied” field 
leads to 

S[(j) + erf\ = £[(/>] + e J dt j dx — k s a 2 d x (j)d x i^ + 0(e 2 ). 

Integrating by parts and requiring that the contribution linear in e vanishes, one obtains 

lim - (S[(f> + er/} — 5[<^]) = — J dt J dx ^ mcj) — k s a 2 d 2 (f> ^ r) = 0. 

(Notice that the boundary terms vanish identically.) Now, since rj was defined to be an 
arbitrary smooth function, the integral above can vanish only if the factor in parentheses is 
globally vanishing. Thus the equation of motion takes the form of a wave equation 

( 1 . 8 ) 



The solutions of Eq. (1.8) have the general form 
4>+{x — vt) + 4>-{x + vt) where v = a^/k s /m, and 
4>± are arbitrary smooth functions of the argu- 
ment. From this we can deduce that the basic low-energy elementary excitations of our 
model are lattice vibrations propagating as sound waves to the left or right at a con- 
stant velocity v (see the figure). 7 The trivial behavior of our model is of course a direct 
consequence of its simplistic definition - no dissipation, dispersion, or other non-trivial 
ingredients. Adding these refinements leads to the general classical theory of lattice vibra- 
tions (such as that described in the text by Ashcroft and Mermin 8 ). Finally, notice that 





Strictly speaking, the modeling of our system enforces a periodicity constraint <p±(x + L) = 4>±(x). However, in 
the limit of a large system, this aspect becomes inessential. 

N. W. Ashcroft and N. D. Mermin, Solid State Physics (Holt— Saunders International, 1983). 





1.1 Classical harmonic chain: phonons 



9 



the elementary excitations of the chain have little in common with its “microscopic” con- 
stituents (the atomic oscillators). Rather they are collective excitations, i.e. elementary 
excitations comprising a macroscopically large number of microscopic degrees of freedom. 

INFO The “relevant” excitations of a condensed matter system can, but need not, be 
of collective type. For example, the interacting electron gas (a system to be discussed in detail 
below) supports microscopic excitations - charged quasi-particles standing in 1:1 correspondence 
with the electrons of the original microscopic system - while the collective excitations are plasmon 
modes of large wavelength and engaging many electrons. Typically, the nature of the fundamental 
excitations cannot be straightforwardly inferred from the microscopic definition of a model. 
Indeed, the mere identification of the relevant excitations often represents the most important 
step in the solution of a condensed matter problem. 



Hamiltonian formulation 

An important characteristic of any 
excitation is its energy. How much 
energy is stored in the sound waves 
of the harmonic chain? To address 
this question, we need to switch back 
to a Hamiltonian formulation. Once 
again, this is achieved by general- 
izing standard manipulations from 
point mechanics to the continuum. 

Remembering that, for a Lagrangian 
of a point particle, p = d±L is the momentum conjugate to the coordinate x, let us consider 
the Lagrangian density and define 9 



Sir William Rowan Hamilton 
1805-65 

A mathematician credited with 
the discovery of quaternions, 
the first non-commutative alge- 
bra to be studied. He also 
invented important new methods 
in mechanics. (Image from W. R. Hamilton, Col- 
lected Papers, vol. II, Cambridge University Press, 
1940. 




7T{X) = : 

d(j)(x ) 



(1.9) 



as the canonical momentum associated with <fi (at the point x). In common with rj), 
the momentum it is a continuum degree of freedom. At each space point it may take an 
independent value. Notice that 7t(x) is nothing but the continuum generalization of the 
lattice momentum Pj of Eq. (1.2). (Applied to Pi, a continuum approximation like (f)i — > 
4>(x) would produce ir(x).) The Hamiltonian density is then defined as usual through the 
Legendre transformation, 



d x cj>, 7 r) = (w<j) - C(<j), d x <j>, </>)) 



rr) 



( 1 . 10 ) 



In field theory literature it is popular to denote the momentum by a Greek letter. 






10 



From, particles to fields 



from where the full Hamiltonian is obtained as H = J Q L dxR. 

Verify that the transition L — » H is a straightforward continuum generalization of 
the Legendre transformation of the A-particle Lagrangian L({<^/}, {<)>/}). 



Having introduced a Hamiltonian, we are in a position to determine the energy of the sound 
waves. Application of Eq. (1.9) and (1.10) to the Lagrangian of the atomic chain yields 
n(x,t) = m(j>{x,i) and 

H[n,<f>] = J dx + ^-(d.^) 2 ^ . (1.11) 



Considering, say, a right-moving sound-wave excitation, <j)(x,t) = <t>+(x — vt), we find that 
n(x,t) = —mvd x (j)+(x — vt) and H[n,</)\ = k s a 2 f dx[d x (f>+(x — vt )] 2 = k s a 2 J dx [d t C cf>+(x)] 2 , 
i.e. a positive definite, time-independent expression, as one would expect. 

Before proceeding, let us note an interesting feature of the energy functional: in the limit 
of an infinitely shallow excitation, d x (j>+ —1 0, the energy vanishes. This sets the stage 
for the last of the principles (4) hitherto unconsidered, symmetry. The Hamiltonian of 
an atomic chain is invariant under simultaneous translation of all atom coordinates by a 
fixed increment: </>/ — f </>/ + <5, where S is constant. This expresses the fact that a global 
translation of the solid as a whole does not affect the internal energy. Now, the ground state 
of any specific realization of the solid will be defined through a static array of atoms, each 
located at a fixed coordinate Ri = la => = 0. We say that the translational symmetry is 

“spontaneously broken,” i.e. the solid has to decide where exactly it wants to rest. However, 
spontaneous breakdown of a symmetry does not imply that the symmetry disappeared. On 
the contrary, infinite-wavelength deviations from the pre-assigned ground state come close 
to global translations of (macroscopically large portions of) the solid and, therefore, cost 
a vanishingly small amount of energy. This is the reason for the vanishing of the sound 
wave energy in the limit d x (f> — > 0. It is also our first encounter with the aforementioned 
phenomenon that symmetries lead to the formation of soft, i.e. low-energy, excitations. A 
much more systematic exposition of these connections will be given in Chapter 6. 

To conclude our discussion of the 
classical harmonic chain, let us con- 
sider the specific heat, a quantity 
directly accessible in experiment. A 
rough estimate of this quantity can 
be readily obtained from the micro- 
scopic harmonic Hamiltonian (1.2). 
According to the principles of statis- 
tical mechanics, the thermodynamic energy density is given by 



Ludwig Boltzmann 1844-1906 

A physicist whose greatest achievement was in the 
development of statistical mechanics, which explains 
and predicts how the properties of atoms (such as 
mass, charge, and structure) determine the visible 
properties of matter (such as viscosity, thermal con- 
ductivity, and diffusion). 



_1 JdTe~P H H _ 1 

L f dT e~d H L df>X 



dT e 



-pH 



where = 1 jk\{F . Z = j di e & H is the Boltzmann partition function and the phase 
space volume element dT = Jl/Li dRjdPi. We will set /cb = 1 throughout. The specific heat 





1.2 Functional analysis and variational principles 



11 



is then obtained as c = dpu. To determine the temperature dependence of this quantity, 
we make use of the fact that, upon rescaling of integration variables, Ri —> (3~ 1 / 2 Xi, 
Pi — > /3^ 1 ^ 2 Yi 1 the exponent (3H(R,P) — > H(X,Y) becomes independent of temperature 
(a property that relies on the quadratic dependence of H on both R and P). The integration 
measure transforms as dT — > f}' ~ N nl „ i dXi dYi = f3 N dV . Expressed in terms of the 
rescaled variables, one obtains the energy density 

u = -ifyln (0~ N K)=pT, 

where p = N/ L is the density of atoms, and we have made use of the fact that the constant 
K = f dT' e _ff ( A d 1 is independent of temperature. We thus find a temperature indepen- 
dent specific heat c = p. Notice that c is fully universal, i.e. independent of the material 
constants M and k s determining H. (In fact, we could have anticipated this result from the 
equipartition theorem of classical mechanics, i.e. the law that in a system with N degrees 
of freedom, the energy scales as U = NT.) 

How do these findings compare with experiment? Figure 1.5 shows the specific heat of 
the insulating compound EUC 0 O 3. 10 For large temperatures, the specific heat approaches 
a constant value, in accord with our analysis. However, for lower temperatures, substantial 
deviations from c = const, appear. Yet, this strong temperature dependence does not reflect 
a failure of the simplistic microscopic modeling. Rather, the deviation is indicative of a 
quantum phenomenon. Indeed, we have so far totally neglected the quantum nature of 
the atomic oscillators. In the next chapter we will rectify this deficiency and discuss how 
the effective low-energy theory of the harmonic chain can be promoted to a quantum field 
theory. However, before proceeding with the development of the theory let us pause to 
introduce a number of mathematical concepts that surfaced above, in a way that survives 
generalization to richer problems. 



1.2 Functional analysis and variational principles 

Let us revisit the derivation of the equations of motion associated with the harmonic chain, 
Eq. (1.8). Although straightforward, neither was the calculation efficient, nor did it reveal 
general structures. In fact, what we did - expanding explicitly to first order in the vari- 
ational parameter e - had the same status as evaluating derivatives by explicitly taking 
limits: f'(x) = lim e _>.o {{f{x + e) — f(x))/e). Moreover, the derivation made explicit use of 
the particular form of the Lagrangian, thereby being of limited use with regard to a gen- 
eral understanding of the construction scheme. Given the importance attached to extremal 
principles in all of field theory, it is worthwhile investing some effort in constructing a more 
efficient scheme for the general variational analysis of continuum theories. In order to carry 
out this program we first need to introduce a mathematical tool of functional analysis, 
namely the concept of functional differentiation. 



10 



In metals, the specific heat due to lattice vibrations exceeds the specific heat of the free conduction electrons 
for temperatures larger than a few degrees kelvin. 




12 



From, particles to fields 




Figure 1.5 Specific heat c p of the insulator EUC 0 O 3 . At large temperatures, the specific heat starts 
to approach a constant value, as predicted by the analysis of the classical harmonic chain. However, 
for small temperatures, deviations from c p = const, are substantial. Such deviations can be ascribed 
to quantum mechanical effects. (Courtesy of M. Kriener, A. Reichl, T. Lorenz, and A. Freimuth.) 



In working with functionals, one is often concerned with how a given functional behaves 
under (small) variations of its argument function. In particular, given a certain function /, 
suspected to make a functional F[f] stationary, one would like to find out whether, indeed, 
the functional remains invariant under variations f — > f + h. where h is an “infinitely small” 
increment function. In ordinary analysis, questions of this type are commonly addressed by 
exploring derivatives , i.e. we need to generalize the concept of a derivative to functionals. 
This is achieved by the following definition: a functional F is called differentiable if 

F[f + eg\-F[f}=e-DF f [g\+0(e 2 ), 

where the differential DFf is a linear functional (i.e. one with DFf[g\ + 52] = DFf\gi] + 
DFf [c/2] ) 1 e is a small parameter, and g is an arbitrary function. The subscript indicates 
that the differential generally depends on the “base argument” f. A functional F is said to 
be stationary on /, if and only if DFf = 0. 

In principle, the definition above answers our question concerning a stationarity condition. 
However, to make use of the definition, we still need to know how to compute the differential 
DF and how to relate the differentiability criterion to the concepts of ordinary calculus. In 
order to understand how answers to these questions can be systematically found, it is helpful 
to return temporarily to a discrete way of thinking, i.e. to interpret the argument / of a 
functional F[f] as the limit N — > 00 of a discrete vector f = {/„ = f(x n ),n = 1 , . . . ,N}, 
where {x n } denotes a discretization of the support of / (cf. the harmonic chain, <f> 0 /). 
Prior to taking the continuum limit, N — > 00, f has the status of an ./V-dimensional vector 
and F( f) is a function defined over ./V-dimensional space. After the continuum limit, f — > f 
becomes a function itself and F( f) —> F[f ] becomes a functional. 

Now, within the discrete picture, it is clear how the variational behavior of functions is to 
be analyzed. For example, the condition that, for all e and all vectors g, the linear expansion 
of F( f + eg) ought to vanish is simply to say that the ordinary differential, dFf , defined 





1.2 Functional analysis and variational principles 



13 



through 



F(f + eg) - F( f) = e • dF f (g) + 0(e 2 ), 



must be zero. In practice, one often expresses conditions of this type in terms of a certain 
basis. In a Cartesian basis of N unit vectors, e„, n = 1 ,N, dFf(g) = (VFf,g), where 
(f, g) = J2n=ifn9n denotes the standard scalar product, VFf = {df r F} represents the 
gradient with the partial derivative defined as 

df n F( f) = lim i [F( f + ee n ) - F( f)] . (1.12) 

From these identities, the differential is identified as 

dF f (g) = ]Tdf rl F(f)ffn. (1.13) 

n 

The vanishing of the differential amounts to vanishing of all partial derivatives d/ n F = 0. 

Equations (1.12) and (1.13) can now be straightforwardly generalized to the continuum 
limit whereupon the summation defining the finite-dimensional scalar product translates to 
an integral, 

N 

(f , g) = /"Sn (/, 9 ) = 

n= 1 

The analog of the nth unit vector is a ^-distribution, e„ — > S x , where S x (x') = S(x — x'), as 
can be seen from the following correspondence: 

fn=( f,e„) = ^2fm{e n )m f(x)=(f,5 x ) = J dx f(x')S x (x'). 



J dx f(x)g(x). 



Here (e n ) m = 5 nm denotes the rnth component of the nth unit vector. The correspondence 
(unit vector o ^-distribution) is easy to memorize: while the components of e„ vanish, save 
for the nth component, that equals unity, S x is a function that vanishes everywhere, save for 
x where it is infinite. That a unit component is replaced by “infinity” reflects the fact that 
the support of the (5-distribution is infinitely narrow; to obtain a unit-normalized integral 
J S x , the function must be singular. 

As a consequence of these identities, Eq. (1.13) translates to the continuum differential, 




SF[f] 

$f{x) 



9 (x), 



(1.14) 



where the generalization of the partial derivative, 



SF[f] 

$f(x) 



lim \-(F[f + e6 x \-F[f]), 
e — >0 e 



(1.15) 



is commonly denoted by “<5” instead of “9.” Ecpiations (1.14) and (1.15) establish the con- 
ceptual connection between ordinary and functional differentiation. Notice that we have 
not yet learned how to calculate the differential practically, i.e. to evaluate expressions 
like Eq. (1.15) for concrete functionals. Nevertheless, the identities above are very use- 
ful, enabling us to generalize more-complex derivative operations of ordinary calculus by 




14 



From particles to fields 



Table 1.1 Summary of basic definitions of discrete and continuum 

calculus. 



Entity 


Discrete 


Continuum 


Argument 


vector f 


function / 


Function(al) 


multidimensional function F( f) 


functional F[f] 


Differential 


dF ( (s) 


DFf [g] 


Cartesian basis 


Gn 


5 X 


Scalar product ( , ) 


Sn f n 9 n 


f dx f{x)g{x) 


“Partial derivative” 


d fn F( f) 


<5*1/1 

«/(*) 



straightforward transcription. For example, the generalization of the standard chain rule, 
df n F(g(i)) = J2m d 9m F ( g) |g=g(f) 9 fn g m ( f) reads 

s_F\m [ H ,sjL M s 9(y)if] 

Sf(x) J 5g{y) g=g[f] Sf{x) 

Here g[f] is the continuum generalization of an R m -valued function, g : K" — > R m , a 
function whose components g(y)[f] are functionals by themselves. Furthermore, given some 
functional F [/] , we can construct its Taylor expansion as 

Fif] = F[0] + J dxi ^1 | + \ /<ti fe IfBmFj\,J {nWX2) + - ' 
where (exercise) 

— — — r = lim (F[f + ei 6 X + £2^ 2 ] — F[f + e\5 Xl ] — F[f + £2^ 2 ] + F[f}) 

0j(X2)df{Xi) ei,2^0 ei e 2 

generalizes a two-fold partial derivative. The validity of these identities can be made plau- 
sible by applying the transcription, Table 1.1, to the corresponding relations of standard 
calculus. To actually verify the formulae, one has to take the continuum limit of each step 
taken in the discrete variant of the corresponding proofs. At any rate, experience shows that 
it takes some time to get used to the concept of functional differentiation. However, after 
some practice it will become clear that this operation is not only extremely useful but also 
as easy to handle as conventional partial differentiation. 

We finally address the question how to compute functional derivatives in practice. In 
doing so, we will make use of the fact that, in all but a few cases, the functionals encountered 
in field theory are of the structure 

(1.16) 

Here, we assume the base manifold M to be parameterized by an m-dimensional coordinate 
vector x = {x M }. (In most practical applications, m = d + 1, and x = (xq, x\, . . . , Xd) 






1.3 Maxwell s equations as a variational principle 



15 



contains one time- like component xq = t and d space-like components Xk,k = 1,.. . ,d.) 
We further assume that the field manifold has dimensionality n and that (f) 1 , i = 1, . . . ,n, 
are the coordinates of the field. 

What makes the functional S[0] easy to handle is that all of its information is stored 
in the function C. Owing to this simplification, the functional derivative can be related to 
an ordinary derivative of C. To see this, all that we have to do is to evaluate the general 
definition Eq. (1.14) on the functional S: 



S[<j> + ee]-S[<t>}= / d m x m + € 0,d li <t> + ed li 9) - d^)} 

Jm 

0 z e + O(e 2 ), 

where in the last line we have assumed that the field variation vanishes on the bound- 
ary of the base manifold, 9\qm = 0. Comparison with Eq. (1.14) identifies the functional 
derivative as 

SS [<f>\ _ dC dC 

5(f> i ( x) d(f i (x) M d{d ll <j) i {x)) 

We conclude that stationarity of the functional (1.16) is equivalent to the condition 




Vx, i 



dC _ dC 

d(j> i (x) M ddn(j> i {x) 



= 0 . 



(1.17) 



Equation (1.17) is known as the Euler— Lagrange equation of field theory. In fact, for d = 
0 and Xo = t , Eq. (1.17) reduces to the familiar Euler Lagrange equation of a point particle 
in n-dimensional space. For d = 1 and (xo,Xi) = (t,x) we get back to the stationarity 
equations discussed in the previous section. In the next section we will apply the formalism 
to a higher-dimensional problem. 



1.3 Maxwell’s equations as a variational principle 

As a second example, let us consider the archetype of classical field theory, classical electro- 
dynamics. Indeed, as well as exemplifying the application of continuum variational principles 
on a problem with which we are all acquainted, this example illustrates the unifying poten- 
tial of the approach: that problems as different as the low-lying vibrational modes of a 
crystalline solid, and electrodynamics can be described by almost identical language indi- 
cates that we are dealing with a useful formalism. Specifically, our aim will be to explore 
how the equations of motion of electrodynamics, the inhomogeneous Maxwell equations, 

V • E = p, VxB-9 f E=j, (1.18) 



11 



Following standard conventions we denote space-like components by small Latin indices k = 1, . . . , d. In con- 
trast, space-time indices are denoted by Greek indices /x = 0, . . . , d. 





16 



From particles to fields 



can be obtained from variational principles. (For simplicity, we restrict ourselves to a vacuum 
theory, i.e. E = D and B = H. Further, we have set the velocity of light to unity, c = 1. 
Within the framework of the variational principle, the homogeneous equations, 

VxE + ftB = 0, V • B = 0, (1.19) 



are regarded as ab initio constraints imposed on the “degrees of freedom” E and B.) 

To formulate Maxwell’s theory as 
a variational principle we require (1) 
a field formulated in a set of suitable 
“generalized coordinates,” and (2) its 
action. As to coordinates, the natural 
choice will be the coefficients of the 
electromagnetic (EM) 4-potential, 

= (<f>, — A), where (f> is the scalar 
and A is the vector potential. The 
potential A is unconstrained and 
uniquely determines the fields E and 
B through the standard equations E = —V<j) — ft A and B = V x A. (In fact, the set of 
coordinates is “overly free” in the sense that gauge transformations A^ — > A^ + ft,r, 
where F is an arbitrary function, leave the physical fields invariant. Later we will comment 
explicitly on this point.) The connection between A and the physical fields can be expressed 
in a more symmetric way by introducing the EM field tensor, 



James Clerk Maxwell 1831-79 

Scottish theoretical physicist and 
mathematician. Amongst many 
other achievements, he is credited 
with the formulation of the theory 
of electromagnetism, synthesizing 
all previous unrelated experiments 
and equations of electricity, mag- 
netism and optics into a consistent theory. (He is also 
known for creating the first true color photograph in 
1861.) 




F = {F, v } 



' 0 


Ei 


e 2 


E 3 


-Ei 


0 


-b 3 


b 2 


—Ei 


b 3 


0 


—B 


—E 3 


-b 2 


Bi 


0 



(1.20) 



The relation between fields and potential now reads = ft,A„ — ft,^ where aft = ( t , — x) 
and = (ft, V). 



EXERCISE Confirm that this connection follows from the definition of the vector potential. To 
verify that the constraint (1.19) is automatically included in the definition (1.20), compute the 
construct d + d^F + d F where (Arzp) represent arbitrary but di erent indices. This 
produces four different terms, identified as the left-hand side of Eq. (1.19). Evaluation of the 
same construct on ift = ft, A — d A M produces zero by the symmetry of the right-hand side. 





1.3 Maxwell s equations as a variational principle 



17 



As to the structure of the action 
S' [A] we can proceed in different 
ways. One option would be to regard 
Maxwell’s equations as fundamen- 
tal, i.e. to construct an action that 
produces these equations upon vari- 
ation (by analogy with the situation 
in classical mechanics where the 
action functional was designed so as 
to reproduce Newton’s equations). 

However, we can also be a little bit 
more ambitious and ask whether the structure of the action can be motivated indepen- 
dently of Maxwell’s equations. In fact, there is just one principle in electrodynamics as 
“fundamental” as Maxwell’s equations: symmetry. A theory of electromagnetism must be 
Lorentz invariant, i.e. invariant under relativistic coordinate transformations. 



Hendrik Antoon Lorentz 1853- 
1928 

1902 Nobel Laureate in Physics 
(with Pieter Zeeman) in recog- 
nition of the extraordinary ser- 
vice they rendered by their 
researches into the influence of 
magnetism upon radiation phe- 
nomena. Lorentz derived the transformation equa- 
tions subsequently used by Albert Einstein to 
describe space and time. 




INFO Let us briefly recapitulate the notion of Lorentz invariance. Suppose we are given a 
4- vector . A linear coordinate transformation X ^ — > X' = T M X is a Lorentz transformation 
if it leaves the 4-metric 



1 

<? = {<?"} = _1 , ( 1 - 21 ) 

-1 

invariant: T T gT = g. To concisely formulate the invariance properties of relativistic theories, it 
is common to introduce the notion of raised and lowered indices. Defining X ,J = gr M X , Lorentz 
invariance is expressed as = X I,J X' IL . 

Aided by the symmetry criterion, we can attempt to conjecture the structure of the action 
from three basic assumptions, all independent of Maxwell’s equations: the action should 
be invariant under (a) Lorentz transformations and (b) gauge transformations, and (c) it 
should be simple! The most elementary choice compatible with these conditions is 



S[A] = j d A x ( Cl + c 2 A^f ) , 



( 1 . 22 ) 



where d 4 x = dx p = dt dx\ dx 2 dx 3 denotes the measure, y), = (p, — j) the 4-current, 
and ci j2 are undetermined constants. Up to quadratic order in A , Eq. (1.22) in fact defines 
the only possible structure consistent with gauge and Lorentz invariance. 

EXERCL Using the continuity equation d^j 11 = 0, verify that the Aj-coupling is gauge invari- 
ant. (Hint: Integrate by parts.) Verify that a contribution like f A M A M would not be gauge 
invariant. 





18 



From, particles to fields 



Having defined a trial action, we can apply the variational principle Eq. (1.17) to compute 
equations of motion. In the present context, the role of the field <j> is taken by the four 
components of A. Variation of the action with respect to A M gives four equations of motion, 



dc oc 

dA , u l 'd{d v A,) U ’ 



F = 0, ...,3, 



(1.23) 



where the Lagrangian density is defined by S = J d 4 x C. With the specific form of C, it is 
straightforward to verify that = C 2 j M and = — 4ciF M!y . Substitution of these 

building blocks into the equations of motion finally yields Ac\d u F l/ll = C 2 j M . Comparing 
this with the definition of the field tensor (1.20), and setting C 1 /C 2 = 1/4, we arrive at 
Maxwell’s equations (1.18). We finally fix the overall multiplicative constant c\ (= C 2 / 4 ) by 
requiring that the Hamiltonian density associated with the Lagrangian density C reproduce 
the known energy density of the EM field (see Problem 1.8). This leads to C\ = — 1/4, so 
that we have identified 



C{A^d v A^) = + A^, 



(1.24) 



as the Lagrangian density of the electromagnetic held. The corresponding action is 
given by S[A] = f d 4 x C{A^d v A^). 

At first sight, this result does not look particularly surprising. After all, Maxwell’s equa- 
tions can be found on the first page of most textbooks on electrodynamics. However, further 
reflection will show that our achievement is actually quite remarkable. By invoking only 
symmetry, the algebraic structure of Maxwell’s equations has been established unambigu- 
ously. We have thus proven that Maxwell’s equations are relativistically invariant, a fact 
not obvious from the equations themselves. Further, we have shown that Eq. (1.18) are the 
only equations of motion linear in the current-density distribution and consistent with the 
invariance principle. One might object that, in addition to symmetry, we have also imposed 
an ad hoc “simplicity” criterion on the action 5 [A]. However, later we will see that this was 
motivated by more than mere aesthetic principles. 

Finally, we note that the symmetry-oriented modeling that led to Eq. (1.22) is illustrative 
of a popular construction scheme in modern field theory. The symmetry-oriented approach 
stands complementary to the “microscopic” formulation exemplified in Section 1.1. Crudely 
speaking, these are the two principal approaches to constructing effective low-energy field 
theories: 



> Microscopic analysis: Starting from a microscopically defined system, one projects 
onto those degrees of freedom that one believes are relevant for the low-energy dynam- 
ics. Ideally, this “belief” is backed up by a small expansion parameter stabilizing the 
mathematical parts of the analysis. Advantages: The method is rigorous and fixes the 
resulting field theory completely. Disadvantages: The method is time-consuming and, for 
sufficiently complex systems, not even viable. 

> Symmetry considerations: One infers an effective low-energy theory on the basis of 
only fundamental symmetries of the physical system. Advantages: The method is fast and 





1.4 Quantum chain 



19 



elegant. Disadvantages: It is less explicit than the microscopic approach. Most impor- 
tantly, it does not fix the coefficients of the different contributions to the action. 

Thus far, we have introduced some basic concepts of field theoretical modeling in condensed 
matter physics. Starting from a microscopic model Hamiltonian, we have illustrated how 
principles of universality and symmetry can be applied to distill effective continuum field 
theories capturing the low-energy content of the system. We have formulated such theories in 
the language of Lagrangian and Hamiltonian continuum mechanics, respectively, and shown 
how variational principles can be applied to extract concrete physical information. Finally, 
we have seen that field theory provides a unifying framework whereby analogies between 
seemingly different physical systems can be uncovered. In the next section we discuss how 
the formalism of classical field theory can be elevated to the quantum level. 



1.4 Quantum chain 



Earlier we saw that, at low temperatures, the excitation profile of the classical atomic chain 
differs drastically from that observed in experiment. Generally, in condensed matter physics, 
low-energy phenomena with pronounced temperature sensitivity are indicative of a quantum 
mechanism at work. To introduce and exemplify a general procedure whereby quantum 
mechanics can be incorporated into continuum models, we next consider the low-energy 
physics of the quantum mechanical atomic chain. 

The first question to ask is conceptual: how can a model like Ecp (1.4) be quantized 
in general? Indeed, there exists a standard procedure for quantizing continuum theories, 
which closely resembles the quantization of Hamiltonian point mechanics. Consider the 
defining Eq. (1.9) and (1.10) for the canonical momentum and the Hamiltonian, respectively. 
Classically, the momentum n(x) and the coordinate <j>(x) are canonically conjugate variables: 
{7r(ar), q !>(x ')} = —8(x—x') where {, } is the Poisson bracket and the 5-function arises through 

continuum generalization of the discrete identity {Pi, Ri>} = — Sn > , 1,1' = 1, ,N. The 

theory is quantized by generalization of the canonical quantization procedure for the discrete 
pair of conjugate coordinates (J?/,P/) to the continuum: (i) promote <j>(x) and n(x) to 
operators, i — >■ 0, 7r i — >• 7r, and (ii) generalize the canonical commutation relation [Pi,Ri>] = 

— ihSn ’ to 12 



[•7r(a:), 4>{x')\ = — ih5{x — x'). 



(1.25) 



Operator-valued functions like (j> and it are generally referred to as quantum fields. For 
clarity, the relevant relations between canonically conjugate classical and quantum fields 
are summarized in Table 1.2. 



INFO By introducing quantum fields, we have departed from the conceptual framework laid 
out on page 6: being operator-valued, the quantized field no longer represents a mapping into an 



12 Note that the dimensionality of both the quantum and the classical continuum field is compatible with the 
dimensionality of the Dirac 5-function, f 5 ( x — x')\ — [length] - 1 , i.e. [d(.xj] = [<f> j ] • [length] -1 ^ 2 and similarly 
for 7T . 





20 



From particles to fields 



Table 1.2 Relations between discrete and continuum, canonically 
conjugate variables/ operators. 





Classical 


Quantum 


Discrete 


{Pi, Ri >} = —Si i' 




Continuum 


(7r(x), 4 >{ x ')} = —S(x — x') 


[7r(x), 4 >( x ')] = —ihS(x — x') 



ordinary differentiable manifold. 13 It is thus legitimate to ask why we bothered to give a lengthy 
exposition of fields as “ordinary” functions. The reason is that, in the not too distant future, 
after the framework of functional field integration has been introduced, we will return to the 
comfortable ground of the definition of page 6. 



Employing these definitions, the classical Hamiltonian density (1.10) becomes the quantum 
operator 

= ^tt 2 + ^~{d x 4>f. (1.26) 

The Hamiltonian above represents a quantum field theoretical formulation of the problem 
but not yet a solution. In fact, the development of a spectrum of methods for the analysis of 
quantum field theoretical models will represent a major part of this text. At this point the 
objective is merely to exemplify the way physical information can be extracted from models 
like Eq. (1.26). As a word of caution, let us mention that the following manipulations, while 
mathematically straightforward, are conceptually deep. To disentangle different aspects of 
the problem, we will first concentrate on plain operational aspects. Later, in Section 1.4, 
we will reflect on “what has really happened.” 

As with any function, operator- valued functions can be represented in a variety of different 
ways. In particular, they can be subjected to Fourier transformation, 



( 



4>k 

7T/c 



1 




ikx 



4>{x) 

7r(x) 




^ \ ' {±ikx 

Ll/2 Z, 



4>k 



(1.27) 



where fTj k represents the sum over all Fourier coefficients indexed by quantized momenta 
k = 2nm/L, m € Z (not to be confused with the “operator momentum” if!). Note that 
the real classical field <j>(x) quantizes to a Hermitian quantum field 4>{x), implying that 
4>k = (and similarly for 7i>). The corresponding Fourier representation of the canonical 
commutation relations reads (exercise) 



[fZ, 4>k'] = —ifoSkk'- 



(1.28) 



13 



At least if we ignore the mathematical subtlety that a linear operator can also be interpreted as an element of 
a certain manifold. 




1.4 Quantum chain 



21 



When expressed in the Fourier representation, making use of the identity 

<5fc+fc',o 

dx(d x (f)) 2 = ^2(-ik4> k )(-ik'$ k >) ^ f dxe~^ k+k ' )x = ^ k 2 (j) k <j)- k = ^fc 2 |</> fc | 2 

fc,fc' J k k 

together with a similar relation for f dir if 2 , the Hamiltonian H = 
f da: R((f>, if) assumes the near diagonal form 

H = ^7r fc ^_ fc + , (1.29) 

2m 2 

fc L J 

where w*, = u|fc| and v = a^/k s /m denotes the classical sound wave 
velocity. In this form, the Hamiltonian can be identified as nothing but 
a superposition of independent harmonic oscillators . 14 This result is actually not difficult 
to understand (see figure): Classically, the system supports a discrete set of wave excitations, 
each indexed by a wave number k = 27T m/L. (In fact, we could have performed a Fourier 
transformation of the classical Helds 4>{x) and tt(x) to represent the Hamiltonian function 
as a superposition of classical harmonic oscillators.) Within the quantum picture, each of 
these excitations is described by an oscillator Hamiltonian operator with a /c-dependent 
frequency. However, it is important not to confuse the atomic constituents, also oscillators 
(albeit coupled), with the independent collective oscillator modes described by H. 

The description above, albeit perfectly valid, still suffers from a deficiency: our analy- 
sis amounts to explicitly describing the effective low-energy excitations of the system (the 
waves) in terms of their microscopic constituents (the atoms). Indeed the different con- 
tributions to H keep track of details of the microscopic oscillator dynamics of individual 
fc-modes. However, it would be much more desirable to develop a picture where the relevant 
excitations of the system, the waves, appear as fundamental units, without explicit account 
of underlying microscopic details. (As with hydrodynamics, information is encoded in terms 
of collective density variables rather than through individual molecules.) As preparation for 
the construction of this improved formulation of the system, let us temporarily focus on a 
single oscillator mode. 




Revision of the quantum harmonic oscillator 

Consider a standard harmonic oscillator (HO) Hamiltonian 




14 The only difference between Eq. (1.29) and the canonical form of an oscillator Hamiltonian H = p 2 / {‘I'm ) + 
muj^x 2 / 2 is the presence of the sub-indices k and —k (a consequence of o( = -k) ■ As we will show shortly, 

this difference is inessential. 




22 



From particles to fields 



• • G 

400 
- 0 - 00 - 

Figure 1.6 Low-lying energy levels/states of the harmonic oscillator. 

The first few energy levels e n = co (n+ and the associated Hermite polynomial eigen- 
functions are displayed schematically in Fig. 1.6. (To simplify the notation we henceforth 
set H = 1.) 

The HO has, of course, the status of a single-particle problem. However, the equidistance 
of its energy levels suggests an alternative interpretation. One can think of a given energy 
state e n as an accumulation of n elementary entities, or quasi-particles, each having energy 
to. What can be said about the features of these new objects? First, they are structureless, 
i.e. the only “quantum number” identifying the quasi-particles is their energy co (otherwise 
n-particle states formed of the quasi-particles would not be equidistant). This implies that 
the quasi-particles must be bosons. (The same state co can be occupied by more than one 
particle, see Fig. 1.6.) 

This idea can be formulated in quantitative terms by employing the formalism of ladder 
operators in which the operators p and x are traded for the pair of Hermitian adjoint 
operators a = v /zz |^( x+ ^p), a* = FwjP)- Up to a factor of i, the transformation 

( x,p ) —1 (d, a^) is canonical, i.e. the new operators obey the canonical commutation relation 

[a, fit] = 1. (1.30) 

More importantly, the a-representation of the Hamiltonian is very simple, namely 

H = co^a^a + , (1.31) 

as can be checked by direct substitution. Suppose, now, we had been given a zero eigenvalue 
state |0) of the operator a: d|0) = 0. As a direct consequence, H\0) = (w/2)|0), i.e. |0) is 
identified as the ground state of the oscillator. 15 The complete hierarchy of higher energy 
states can now be generated by setting |?r) = (n!) -1 / 2 (a'l') n |0). 

EXERCISE Using the canonical commutation relation, verify that H\n ) = co(n + l/2)|n) and 

(n|n) = 1. 

Formally, the construction above represents yet another way of constructing eigenstates of 
the quantum HO. However, its “real” advantage is that it naturally affords a many-particle 
interpretation. To this end, let us declare |0) to represent a “vacuum” state, i.e. a state with 
zero particles present. Next, imagine that dijO) is a state with a single featureless particle 




15 



This can be verified by explicit construction. Switching to a real-space representation, the solution of the 
equation [ x + d x / {m<j))\(x\0) = 0 obtains the familiar ground state wavefunction (tc|0) = ^/mu /2ne~ rnux / 2 . 




1.4 Quantum chain 



23 



e 




k 



Figure 1.7 Diagram visualizing an excited state of the chain. Here, the number of quasi-particles 
decreases with increasing energy LOk- 



(the operator a f does not carry any quantum number labels) of energy u>. Similarly, (ir ) ra |0) 
is considered as a many-body state with n particles, i.e. within the new picture, a' is an 
operator that creates particles. The total energy of these states is given by w x (occupation 
number). Indeed, it is straightforward to verify (see exercise above) that afa\n) = n\n), i.e. 
the Hamiltonian basically counts the number of particles. While, at first sight, this may 
look unfamiliar, the new interpretation is internally consistent. Moreover, it achieves what 
we had asked for above, i.e. it allows an interpretation of the HO states as a superposition 
of independent structureless entities. 

INFO The representation above illustrates the capacity to think about individual quantum 
problems in complementary pictures. This principle finds innumerable applications in modern 
condensed matter physics. The existence of different interpretations of a given system is by no 
means heretical but, rather, reflects a principle of quantum mechanics: there is no “absolute” 
system that underpins the phenomenology. The only thing that matters is observable phenomena. 
For example, we will see later that the “fictitious” quasi-particle states of oscillator systems 
behave as “real” particles, i.e. they have dynamics, can interact, be detected experimentally, etc. 
From a quantum point of view these object are, then, real particles. 



Quasi-particle interpretation of the quantum chain 

Returning to the oscillator chain, one can transform the Hamiltonian (1.29) to a form 
analogous to (1.31) by defining the ladder operators 16 



ak = 



mtOk 



+ 









at = 



mujk 






mujk 






(1.32) 



With this definition, applying the commutation relations Eq, (1.28), one finds that the 
ladder operators obey commutation relations generalizing Eq. (1.30): 



ak,a 



= 5kk', [hfcjOfc'] — 



4 . 4 ' 



= 0. 



(1.33) 



16 As to the consistency of these definitions, recall that </>], = (fi—k and 7r^ — 7 r_fc. Under these conditions the 
second of the definitions following in the text follows from the first upon taking the Hermitian adjoint. 




24 



From, particles to fields 




Figure 1.8 Phonon spectra of the transition metal oxide Si^RuCU measured along different axes in 
momentum space. Notice the approximate linearity of the low-energy branches (acoustic phonons) 
at small momenta. Superimposed at high frequencies are various branches of optical phonons. 
(Source: Courtesy of M. Braden, II. Physikalisches Institut, Universitat zu Koln.) 



Expressing the operators (0*,, tt k ) hr terms of (a*,, a\), it is now straightforward to bring the 
Hamiltonian into the quasi-particle oscillator form (exercise) 




Equations (1-34) and (1.33) represent the final result of our analysis. The Hamiltonian H 
takes the form of a sum of harmonic oscillators with characteristic frequencies u>k- In the 
limit k — > 0 (i.e. long wavelength), one finds uik — > 0; excitations with this property are said 
to be massless. 

An excited state of the system is indexed by a set {rik} = (ni, ri 2 , . . . ) of quasi-particles 
with energy {w*,} (see Fig. 1.7). Physically, the quasi-particles of the harmonic chain are 
identified with the phonon modes of the solid. A comparison with measured phonon 
spectra (Fig. 1.8) reveals that, at low momenta, uik ~ |fc| in agreement with our simplistic 
model (even in spite of the fact that the spectrum was recorded for a three-dimensional 
solid with non-trivial unit cell - universality!). While the linear dispersion was already a 
feature of the classical sound wave spectrum, the low-temperature specific heat reflected 
non-classical behavior. It is left as an exercise (problem 1.8) to verify that the quantum 
nature of the phonons resolves the problem with the low-temperature specific heat discussed 
in Section 1.1. (For further discussion of phonon modes in atomic lattices we refer to Chapter 
2 of the text by Kittel. 17 ) 



1.5 Quantum electrodynamics 

The generality of the procedure outlined above suggests that the quantization of the EM 
field Eq. (1.24) proceeds in a manner analogous to the phonon system. However, there are 
a number of practical differences that make quantization of the EM field a harder (but 
also more interesting!) enterprise. Firstly, the vectorial character of the vector potential, in 
combination with the condition of relativistic covariance, gives the problem a non-trivial 
internal geometry. Closely related, the gauge freedom of the vector potential introduces 
redundant degrees of freedom whose removal on the quantum level is not easily achieved. For 



17 



C. Kittel, Quantum Theory of Solids , 2nd edition (Wiley, 1987). 





1.5 Quantum electrodynamics 



25 



example, quantization in a setting where only physical degrees of freedom are kept - i.e. the 
two polarization directions of the transverse photon field - is technically cumbersome, the 
reason being that the relevant gauge condition is not relativistically covariant. In contrast, a 
manifestly covariant scheme, while technically more convenient, introduces spurious “ghost 
degrees of freedom” that are difficult to remove. To circumvent a discussion of these issues, 
we will not discuss the problem of EM field quantization in full detail. 18 On the other hand, 
the photon field plays a much too important role in condensed matter physics for us to 
drop the problem altogether. We will therefore aim at an intermediate exposition, largely 
insensitive to the problems outlined above but sufficiently general to illustrate the main 
principles. 



Field quantization 

Consider the Lagrangian of the matter-free EM field, L = — ^ f d 3 x . As a first step 

towards quantization of this system we fix a gauge. In the absence of charge, a particularly 
convenient choice is the Coulomb gauge, V ■ A = 0, with the scalar component (f> = 
0. (Keep in mind that, once a gauge has been set, we cannot expect further results to 
display “gauge invariance.”) Using the gauge conditions, one may verify that the Lagrangian 
assumes the form 

L= l - I d 3 x [(d t A) 2 - (V x A) 2 ] . (1.35) 

By analogy with our discussion of the atomic chain, we would now proceed to “decouple” 
the theory by expanding the action in terms of eigenfunctions of the Laplace operator. 
The difference with our previous discussion is that we are dealing (i) with the full three- 
dimensional Laplacian (instead of a simple second derivative) acting on (ii) the vectorial 
quantity A that is (iii) subject to the constraint V • A = 0. It is these aspects which lead 
to the complications outlined above. 

We can circumvent these difficulties by considering cases where the geometry of the system 
reduces the complexity of the eigenvalue problem. This restriction is less artificial than it 
might appear. For example, in anisotropic electromagnetic waveguides, the solutions of the 
eigenvalue equation can be formulated as 19 

-V 2 R fc (x) = A fc R fc (x), (1.36) 

where k € K. is a one- dimensional index and the vector-valued functions R*. are real and 
orthonormalized, f R/,. • R*,/ = Skk' ■ The dependence of the eigenvalues on k depends on 
details of the geometry (see Eq. (1.38) below) and need not be specified for the moment. 

INFO An electrodynamic waveguide is a quasi-one-dimensional cavity with metaliic bound- 
aries (see Fig. 1.9). The practical advantage of waveguides is that they are good at confining EM 



18 Readers interested in learning more about EM field quantization are referred to, e.g., L. H. Ryder, Quantum 
Field Theory (Cambridge University Press, 1996). 

19 More precisely, one should say that Eq. (1.36) defines the set of eigenfunctions relevant for the low-energy 
dynamics of the waveguide. More-complex eigenfunctions of the Laplace operator exist but they carry much 
higher energy. 




26 



From particles to fields 




Figure 1.9 EM waveguide with rectangular cross-section. The structure of the eigenmodes of the 
EM field is determined by boundary conditions at the walls of the cavity. 



waves. At large frequencies, where the wavelengths are of order meters or less, radiation loss in 
conventional conductors is high. In these frequency domains, hollow conductors provide the only 
practical way of transmitting radiation. 

EM field propagation inside a waveguide is constrained by boundary conditions. Assuming 
the walls of the system to be perfectly conducting, 

E||(x b ) = 0, Bj_(x b ) = 0, (1.37) 

where x b is a point at the system boundary and Ey (Bj_) is the parallel (perpendicular) com- 
ponent of the electric (magnetic) field. 

For concreteness, returning to the problem of field quantization, let us consider a cavity with 
uniform rectangular cross-section L y x L z . To conveniently represent the Lagrangian of the 
system, we wish to express the vector potential in terms of eigenfunctions Rk that are consistent 
with the boundary conditions (1.37). A complete set of functions fulfilling this condition is given 
by 

( ci cos (k x x) sin(k y y) sin(fe 2 «) 

C 2 sin(fca,a:) cos(k y y) sin (k z z) 

C 3 sin(fca,a;) sin(k y y ) cos (k z z) 

Here, ki = nin/Li, rii £ N, i = x,y,z, Mk is a factor normalizing Rk to unit modulus, and the 
coefficients Ci are subject to the condition C\k x + C 2 k y + c$k z = 0. Indeed, it is straightforward 
to verify that a general superposition of the type A(x, t) = a k(f)Rk(x), «k (t) £ R, is diver- 
genceless, and generates an EM field compatible with (1.37). Substitution of Rk into Eq. (1.36) 
identifies the eigenvalues as Ak = k 2 + k 2 + k 2 . In the physics and electronic engineering litera- 
ture, eigenfunctions of the Laplace operator in a quasi-one-dimensional geometry are commonly 
described as modes. As we will see shortly, the energy of a mode (i.e. the Hamiltonian evaluated 
on a specific mode configuration) grows with | Ak | - In cases where one is interested in the low- 
energy dynamics of the EM field, only configurations with small |Ak| are relevant. For example, 
let us consider a massively anisotropic waveguide with L z < L y -C L x . In this case the modes 
with smallest |Afc| are those with k z = 0, k v = ir /L y , and k x = k -C Lj :y . (Why is it not possible 
to set both k y and k z to zero?) With this choice, 

A*, = k 2 + , (1.38) 

and a scalar index k suffices to label both eigenvalues and eigenfunctions Rfc. A caricature of 
the spatial structure of the functions R^ is shown in Fig. 1.9. The dynamical properties of these 
configurations will be discussed in the text. 





1.5 Quantum electrodynamics 



27 



Returning to the problem posed by Ecp (1.35) and (1.36), one can expand the vector poten- 
tial in terms of eigenfunctions Rj, as A(x, t) = ctfc(t)Rfc(x), where the sum runs over 
all allowed values of the index parameter k. (In a waveguide, k = nn/L , n £ N, where 
L is the length of the guide.) Substituting this expansion into Ecp (1.35) and using the 
normalization properties of Rfc, we obtain L = \ (dj( — A fcO^), i.e. a decoupled repre- 
sentation where the system is described in terms of independent dynamical systems with 
coordinates «&. From this point on, quantization proceeds along the lines of the standard 
algorithm. Firstly, define momenta through the relation = da k L = dk- This produces 
the Hamiltonian H = \ {'Kk'^k + A k&kOtk)- Next quantize the theory by promoting fields 

to operators a*, — f dk and 7Tfc — > 7q,., and declaring [ dk , dk’] = — iSkk >■ The quantum Hamil- 
tonian operator, again of harmonic oscillator type, then reads 

H — ~ ^ ' (fffcf T k + OJkdkOLk) ; 
k 

where w 2 = A k- Following the same logic as marshaled in Section 1.4, we then define ladder 
operators a*, = \f^[pLk + ^^k)i a l = ~ lJ k ^k)i whereupon the Hamiltonian 

assumes the now familiar form 

H = ^k(a{a k + l). (1.39) 

k ^ ' 

For the specific problem of the first excited mode in a waveguide of width L y , uik = 

1 /2 

[A: 2 + (7r/L y ) 2 ] . Eciuation (1.39) represents our final result for the quantum Hamiltonian 

of the EM waveguide. Before concluding this section let us make a few comments on the 
structure of the result: 

> Firstly, notice that the construction above almost completely paralleled our previous dis- 
cussion of the harmonic chain. 20 The structural similarity between the two systems finds 
its origin in the fact that the free field Lagrangian (1.35) is quadratic and, therefore, bound 
to map onto an oscillator-type Hamiltonian. That we obtained a simple one- dimensional 
superposition of oscillators is due to the boundary conditions specific to a narrow waveg- 
uide. For less restrictive geometries, e.g. free space, a more complex superposition of 
vectorial degrees of freedom in three-dimensional space would have been obtained. How- 
ever, the principal mapping of the free EM field onto a superposition of oscillators is 
independent of geometry. 

t> Physically, the quantum excitations described by Eq. (1.39) are, of course, the photons of 
the EM field. The unfamiliar appearance of the dispersion relation u>k is again a peculiarity 
of the waveguide. However, in the limit of large longitudinal wave numbers k Ly 1 , the 
dispersion approaches uik ~ |fc|, i.e. the relativistic dispersion of the photon field. Also 
notice that, due to the equality of the Hamiltonians (1-34) and (1.39), all that has been 
said about the behavior of the phonon modes of the atomic chain carries over to the 
photon modes of the waveguide. 



20 Technically, the only difference is that instead of index pairs ( k . — k ) all indices ( k . k) are equal and positive. 
This can be traced back to the fact that we have expanded in terms of the real eigenfunctions of the closed 
waveguide instead of the complex eigenfunctions of the circular oscillator chain. 




28 



From particles to fields 



> As with their phononic analogue, the oscillators described by Eq. (1.39) exhibit zero-point 
fluctuations. It is a fascinating aspect of quantum electrodynamics that these oscilla- 
tions, caused by quantization of the most relativistic field, surface at various points of 
non-relativistic physics. In the section below two prominent manifestations of zero-point 
fluctuations in condensed matter physics will be briefly discussed. 



Vacuum fluctuations in matter 

The quantum oscillation of the electromagnetic field manifests itself in a multitude of physi- 
cal phenomena. In the following, we shall briefly discuss some of the more prominent effects 
caused by the dynamics of the quantum vacuum. 

One of the most important phenomena induced by vacuum fluctuations is the Casimir 
effect : 21 two parallel conducting plates embedded into the vacuum exert an attractive 
force on each other. This phenomenon is not only of conceptual importance - better than 
anything else it demonstrates that the vacuum is “alive” - but also of applied relevance. 
For example, the force balance of hydrophobic suspensions of particles of size 0.1 -1/zm in 
electrolytes is believed to be strongly influenced by Casimir forces. At least it is applica- 
tions in colloidal chemistry which prompted Casimir to his famous analysis of the idealized 
vacuum problem. Qualitatively, the origin of the Casimir force is readily understood. Like 
their classical analog, quantum photons exert a certain radiation pressure on macroscopic 
media. The difference with the classical case is that, due to zero-point oscillations, even the 
quantum vacuum is capable of creating radiation pressure. For a single conducting body 
embedded into the infinite vacuum, the net pressure vanishes by symmetry. However, for 
two parallel plates, the situation is different. Mode quantization arguments similar to the 
ones used in the previous section show that the density of quantum modes between the 
plates is lower than in the semi-infinite outer spaces. Hence, the force (density) created by 
outer space exceeds the counter-pressure from the inside; the plates “attract” each other. 
The unambiguous measurement of the Casimir force required advanced nano-technological 
instrumentation and has succeeded only recently. 22 

A second important phenomenon where vacuum fluctuations play a role is van der Waals 
forces: atoms or molecules attract each other by a potential that, at small separation r, 
scales as r^ 6 . Early attempts to explain classically the phenomenon produced results for the 
force that, in conflict with experimental findings, were strongly temperature dependent. It 
was considered a major breakthrough of the new quantum mechanics when London proposed 
a model whereby a temperature- independent r~ e law was obtained. The essence of London’s 



21 H. B. G. Casimir and D. Polder, The influence of retardation on the London— van der Waals forces, Phys. Rev. 
73 , 360—72 (1948); H. B. G. Casimir, On the attraction between two perfectly conducting plates, Proc. Kon. 
Nederland. Akad. Wetensch. 51 , 793—6 (1948). 

22 S. K. Lamoreaux, Demonstration of the Casimir force in the 0.6 to 6 m Range, Phys. Rev. Lett. 78 , 5—9 (1997); 
G. Bressi et al., Measurement of the Casimir force between parallel metallic surfaces, Phys. Rev. Lett. 88, 
41804—9 (2002). For a review on recent developments see M. Bordag et al ., New developments in the Casimir 
effect, Phys. Rep. 353 , 1-205 (2001). 




1.5 Quantum electrodynamics 



29 




Figure 1.10 On the creation of van der Waals forces by EM vacuum fluctuations. An atom or 
molecule B is exposed to fluctuations of the EM quantum vacuum (depicted by wavy lines). This 
leads to the creation of a dipole element ds. A nearby partner atom A sees both the vacuum 
amplitude and the dipole field created by d_e- The cooperation of the two field strengths leads to 
a net lowering of the energy of A. 

idea is easily explained: imagine each of the atoms/molecules as an oscillator. The zero- 
point motion of these oscillators, measured in terms of some coordinate x, creates a dipole 
moment of strength ~ x. At close distances, the systems interact through a dipole-dipole 
interaction. 23 Writing down coupled oscillator equations (see Problem 1.8), one readily 
obtains a lowering of the ground state energy by a contribution V(r ) ~ r~ 6 . 

However, there is a subtle point 
about the seemingly innocuous cou- 
pling of the quantum oscillator to the 
classical Coulomb potential. Close 
inspection (see Milonni’s text for 
details 24 ) shows that, to maintain 
the quantum commutation relations 
of the oscillator degrees of freedom, 
the EM field must not be assumed 
entirely classical. In fact, once the 
quantum nature of the EM field has 
entered the stage, it becomes possible to explain the force without need for phenomenolog- 
ical introduction of atomic oscillator degrees of freedom. 

Consider an atom A exposed to an electric field E (see Fig. 1.10). The field gives rise to 
the atomic Stark effect, i.e. a level shift of order W ~ a(E 2 (x^)), where a is the atomic 
polarizability, E = <9 t A is the operator representing the EM field, and (...) a quantum 
expectation value. For an atom in empty space, E(x^) = Eo(x^) is an operator measuring 
the vacuum fluctuations of the free field. However, in the neighborhood of a second atom 
B , an induced dipole contribution adds to E 0 . The point is that the zero-point amplitude 
E 0 induces a dipole element dg oc aE 0 (xg) which, in turn, creates a dipole field E^ 



Johannes Diderik van der Waals 
1837-1923 

1910 Nobel Laureate in physics 
in recognition of his work on the 
state equation of non-ideal gases. 

Van der Waals was one of the 
first to postulate inter-molecular 
forces, at a time when the exis- 
tence of atoms and molecules was still disputed. 
Other important work includes the derivation of first 
variants of thermodynamic scaling relations. 




23 In passing, we mention that various animal species benefit from the short-distance efficiency of the van der 
Waals force. For instance, geckos and spiders owe their ability to climb up planar surfaces of basically any 
material to the presence of bushels of ultra-fine hair (about three orders of magnitude thinner than human hair) 
on their feet. The tips of these hairs come close enough to the atoms of the substrate material to make the van 
der Waals force sizeable. Impressively, this mechanism provides a force of about two orders of magnitude larger 
than that required to support a spider’s full body weight. Both spiders and geckos have to “roll” their feet off 
the surface to prevent getting stuck by the enormous power of the forces acting on their many body hairs! 

P. W. Milonni, The Quantum Vacuum (Academic Press, 1994). 




30 



From, particles to fields 



contributing to the total field amplitude felt by atom A: E(x A ) = (E 0 + E B )(x A ) (see 
Fig. 1.10). The dominant contribution to the level shift due to the proximity of B is then 
given by 

V = W — a(E 2 (x A )) ~ a(E 0 (x A ) ■ E B (x A )) - a 2 (Ej (x A )iT(x A - x B )E(x B )), (1.40) 

where K is a matrix kernel describing the geometric details of the dipole interaction. (Here, 
we have subtracted the vacuum level shift because it represents an undetectable offset.) 
Notice that Eq. (1.40) predicts the van der Waals interaction to be proportional to the 
square of the zero-point field amplitude. 



Unfortunately, the quantitative 
evaluation of the matrix element, 
involving a geometric average over 
all polarization directions of the 
quantum field amplitude, is some- 
what involved. As a final result of a 
calculation 25 one obtains 



U(r) = - 



Sujqot 
4 r 6 



(1.41) 



Johannes Stark 1874-1957 

Nobel laureate 1919 for his dis- 
covery of the Stark effect in 1913. 

Later, Stark became infamous for 
the role he played during the 
third Reich. He attacked theo- 
retical physics as “Jewish" and 
stressed that scientific positions in Nazi Germany 
should only be held by pure-blooded Germans. Image 
©The Nobel Foundation 




where r = |x A — x B | is the distance between the atoms and coq is the transition frequency 
between the ground state and the first excited state of the atom. (This parameter enters 
the result through the dependence of the microscopic polarizability on the transition fre- 
quencies.) 



1.6 Noether’s theorem 

According to a basic paradigm of physics every continuous symmetry of a system entails a 
conservation law. 26 Conservation laws, in turn, greatly simplify the solution of any prob- 
lem which is why one gets acquainted with the correspondence (symmetry -f-f conservation 
law) at a very early stage of the physics curriculum, e.g. the connection between rotational 
symmetry and the conservation of angular momentum. However, it is not all that trivial 
to see (at least within the framework of Newtonian mechanics) that the former entails 
the latter. One needs to know what to look for (angular momentum) to identify the con- 
served quantity corresponding to rotational invariance. It is one of the major advantages of 
Lagrangian over Newtonian mechanics that it provides one with a tool - Noether’s theorem 
- to automatically identify the conservation laws generated by the symmetries of classical 
mechanics. 



25 Detailed, e.g., in Milonni, The Quantum Vacuum , Section 3.11. 

26 Before exploring the ramifications of symmetries and conservation laws for fields, it may be instructive to 
recapitulate Noether’s theorem in the context of classical point-particle mechanics — see, e.g., L. D. Landau and 
E. M. Lifshitz, Classical Mechanics (Pergamon, 1960). 





1.6 Noethers theorem 



31 



What happens when one advances 
from point to continuum mechan- 
ics? 27 Clearly, multidimensional con- 
tinuum theories leave more room for 
the emergence of complex symme- 
tries; even more so than in classical 
mechanics, we are in need of a tool 
identifying the corresponding con- 
servation laws. Fortunately, it turns 
out that Noether’s theorem of point 
mechanics affords a more or less 
straightforward generalization to 
higher dimensions. Starting from 
the general form of the action of a 
continuum system, Eq. (1.16), the 
continuum version of Noether’s the- 
orem will be derived below. In not 
referring to a specific physical problem, our discussion will be somewhat dry. This lack 
of physical context is, however, more than outweighed by the general applicability of the 
result: the generalized form of Noether’s theorem can be - without much further thought - 
applied to generate the conservation laws of practically any physical symmetry. In this 
section, we will illustrate the application of the formalism on the simple (yet important) 
example of space-time translational invariance. A much more intriguing case study will be 
presented in Section 4.3 after some further background of quantum field theory has been 
introduced. 



Amalie E. Noether 1882-1935 

German mathematician known 
for her groundbreaking contri- 
butions to abstract algebra and 
theoretical physics. Alive at a 
time when women were not supposed to attend 
college preparatory schools, she was often forbidden 
from lecturing under her own name. Despite these 
obstacles, Noether became one of the greatest 
algebraists of the century. Described by Albert Ein- 
stein as the most significant creative mathematical 
genius thus far produced since the higher education 
of women began, she revolutionized the theories 
of rings, fields, and algebras. In physics, Noether's 
theorem explains the fundamental connection 
between symmetry and conservation laws. In 1933, 
she lost her teaching position due to being a Jew 
and a woman, and was forced out of Germany by 
the Nazis. 




Symmetry transformations 

The symmetries of a physical system manifest themselves in the invariance of its action 
under certain transformations. Mathematically, symmetry transformations are described by 
two pieces of input data: firstly, a mapping M — > M, x K > x'(x ) that assigns to any point of 
the base manifold some “transformed” point; secondly, the field configurations themselves 
may undergo some change, i.e. there may be a mapping (<j> : M — > T) H > {(jJ : M — » T) that 
defines transformed values <j>'{x') = F[{(f>(x)}] in terms of the “old” field. It is important to 
understand that these two operations may, in general, be independent of each other. 



27 



Later on we shall see that all we are discussing here carries over to the quantum level. 





32 



From particles to fields 



For example, suppose we want to explore the 
invariance properties of a theory under transla- 
tions in space-time. In this case, we can consider 
a mapping x' = x+a, a £ R m , = </>(x). This 

describes the translation of a field by a fixed offset 
a in space-time (see figure). The system is trans- 
lationally invariant if and only if 5[^] = As 

a second example, let us probe rotational sym- 
metry: x' = Rx, where R £ O (m) is a rotation 
of Euclidean space-time. In this case it would, in 
general, be unphysical to define <f>' ( x ') = 4>(x). 

To illustrate this point, consider the example of 
a vector field in two dimensions n = m = 2 (see 
the figure.) A properly rotated field configuration 
is defined by fi'(x') = R<j>(x), i.e. the field ampli- 
tude actively participates in the operation. In fact, one often considers symmetry operations 
where only the fields are transformed while the base manifold is left untouched. 28 For exam- 
ple, the intrinsic 29 rotational invariance of a magnet is probed by x' = x, m'{ x) = R-m(x ), 
where the vector field m describes the local magnetization. 

To understand the impact of a symmetry transformation, it is fully sufficient to con- 
sider its infinitesimal version. (Any finite transformation can be generated by successive 
application of infinitesimal ones.) Consider, thus, the two mappings 



<P‘ 






L* a J x 

X x' 




u a (x), 

LJ—0 

4> l {x) -t <t> n (x ') = <j> l {x) + w a (x)F* [(/)], (1.42) 

expressing the change of both fields and coordinates to linear order in a set of param- 
eter functions {w a } characterizing the transformation. (For a three-dimensional rotation, 
(wi,W 2 ,W 3 ) = ((j>,9,t()) would be the rotation angles, etc.) The functionals {Ffi} - which 
need not depend linearly on the field and may explicitly depend on the coordinate x - 
define the incremental change (j>'{x') — 4>(x). 

We now ask how the action Eq. (1.16) changes under the transformation Eq. (1.42), i.e. 
we wish to compute the difference 



dxjj. 

duJ a 



AS = j d m x' C{<j> H (x'),d x ^<f> H (x')) 



d m x £(<j> l (x),d Xtl <f> l (x)). 



28 For example, the standard symmetry transformations of classical mechanics, q(t ) —> q'(t), belong to this class: 
the coordinate vector of a point particle, q (a “field” in 0 + 1 space— time dimensions) changes while the “base” 
(time t ) does not transform. 

29 “Intrinsic” means that we rotate just the spins but not the entire magnet (as in our second example). 





1.6 Noethers theorem 



Inserting Eq. (1.42), using the fact that dx'^/dx^ = S + ( d/dx v ) (w 0 <9a; M /<9w a ), together 
with 30 \dx' /dx v \ ~ 1 + ( d/dx ^ ) {ujadx^/ dco a ), one obtains 

AS ~ J d m x(l + d Xli (u a d Ua x IJ ,))C(<j>' 1 + F l a u a ,(5,j, v - d x ^(u a d Ua x v ))d x „((j> 1 + F l a u a )) 

- J d m xC{(j) l {x),d Xil 4> l {x)). 

So far, we did not use the fact that the transformation was actually meant to be a symmetry 
transformation. By definition, we are dealing with a symmetry if, for constant parameters 
oj a - a uniform rotation, global translation, etc. - the action difference AS vanishes. In 
other words, the leading contribution to the action difference of a symmetry transformation 
must be linear in the derivatives A straightforward expansion of the formula above 

for AS shows that these terms are given by 

as sy = • - J d m x j;(x)d^ a , 

where the so-called Noether current is given by 

(1.43) 

For a completely general field configuration, there is not much can be said about the Noether 
current (whether or not the theory possesses the symmetry). However, if the field </> obeys 
the classical equations of motion and the theory is symmetric, the Noether current is locally 
conserved, = 0. This follows from the fact that, for a solution 4> of the Euler- Lagrange 
equations, the linear variation of the action in any parameter must vanish. Specifically, 
the vanishing of AS'[(/>] for arbitrary symmetry transformations {w a } enforces the condition 
df J'lj = 0. (But keep in mind the fact that the conservation law holds only on the classical 
level!) Therefore, in summary, we have Noether’s theorem: 

A continuous symmetry entails a classically conserved current. 

The local conservation of the current entails the existence of a globally conserved “charge.” 
For example, for a base manifold x = (xq, x\, . . . , Xd) in Euclidean (1 + (f)-dimensional 
space-time, integration over the space-like directions, and application of Stokes’ theorem, 
gives do Q a = 0, where 31 

Q a — J ddx Jo. (1-44) 

is the conserved charge and we have assumed that the current density vanishes at spatial 
infinity. 

30 This follows from — — — det — — — exp tr In — — ~ exp ( ( u a ’ 7 / ' ) ) 1 4 ( ui a X>1 ) . 

x u x u x u \ x ix \ w a / / x /x \ w a / 

(Exercise: Show that det A — exptr In A , where A is a linear operator. Hint: Use the eigenbasis.) 

31 Notice that the integral involved in the definition of Q runs only over spatial coordinates. 







34 



From, particles to fields 



Notice that nowhere in the discussion above have we made assumptions about the internal 
structure of the Lagrangian. In particular, all results apply equally to the Minkowskian and 
the Euclidean formulation of the theory. 



Example: translational invariance 



As an example that is as elementary as it is important, we consider the case of space-time 
translational invariance. The corresponding symmetry transformation is defined by x ' = 
x^+a^, <j>'{x') = The infinitesimal version of this transformation reads as x ^ 
where we have identified the parameter index a with the space-time index p. Noether’s 
current, which in the case of translational invariance is called the energy— momentum 
tensor, is given by T 



dC 



r u (x) = 

A ’ d{d^) 



d v <i> % - S^C. 



The conserved “charges” corresponding to this quantity are 



P" 



d d x 



dC 




Specifically, P° is the energy and P l , l = 1 , ,d the total momentum carried by the system. 



EXERCISE Consider the one-dimensional elastic chain discussed in Section 1.1. Convince your- 
self that the continuity equation of the system assumes the form dt f ^ dxp(x, t) = dt( 4 >(x 2 ,t) — 
rf>(xi,t)), where p(x,t) is the local density of the medium. Use this result to show that p = dxfi- 
Show that the momentum f dx (particle density) x (velocity) = f dxpdtfi carried by the system 
coincides with the Noether momentum. 



EXERCISE Compute the energy-momentum tensor for the Lagrangian of the free electromag- 
netic field. Derive the corresponding energy-momentum vector and convince yourself that you 
obtain results familiar from electrodynamics. 

Admittedly, translational invariance is not a particularly exciting symmetry and, if this 
were the only example of relevance, the general discussion above would not have been worth 
the effort. However, later in the text, we shall exemplify (see, for example, Section 4.3) how 
concepts hinging on Noether’s theorem can be applied to obtain far-reaching results that 
are not readily accessible by different means. 



1.7 Summary and outlook 

In this chapter we have introduced the general procedure whereby classical continuum 
theories are quantized. Employing the elementary harmonic oscillator as a example, we have 
seen that the Hilbert spaces of these theories afford different interpretations. Of particular 
use was a quasi-particle picture whereby the collective excitations of the continuum theories 
acquired the status of elementary particles. 




1.8 Problems 



35 



Suspiciously, both examples discussed in this text, the harmonic quantum chain and free 
quantum electrodynamics, led to exactly solvable free field theories. However, it takes 
only little imagination to foresee that only few continuum theories will be as simple. Indeed, 
the exact solvability of the atomic chain would have been lost had we included higher-order 
contributions in the expansion in powers of the lattice displacement. Such terms would 
hinder the free wave-like propagation of the phonon modes. Put differently, phonons would 
begin to scatter off each other (i.e. interact). Similarly, the free status of electrodynamics is 
lost once the EM field is allowed to interact with a matter field. Needless to say, interacting 
field theories are infinitely more complex, but also more interesting, than the systems 
considered so far. 

Technically, we have seen that the phonon/photon interpretation of the field theories 
discussed in this chapter could conveniently be formulated in terms of ladder operators. 
However, the applications discussed so far provide only a glimpse of the advantages of 
this language. In fact, the formalism of ladder operators, commonly described as “second 
quantization,” represents a central, and historically the oldest, element of quantum field 
theory. The next chapter is devoted to a more comprehensive discussion of both the formal 
aspects and applications of this formulation. 



1.8 Problems 

Electrodynamics from a variational principle 

Choosing the Lorentz-gauged components of the vector potential as generalized coordinates, the aim 
of this problem is to show how the wave equations of electrodynamics can be obtained as a variational 
principle. 

Electrodynamics can be described by Maxwell’s equations or, equivalently, by wave-like 
equations for the vector potential. Working in the Lorentz gauge, d t (j) = V ■ A, these 
equations read ( (9 2 — V 2 ) <f> = p, (<9 2 — V 2 ) A = j. Using relativistically covariant notation, 
the form of the equations can be compressed further to c^cUA" = j v . Starting from the 
Lagrangian action, 5 [A] = — f d 4 x ( ) F ltv F l ‘ 1 ' + A m j m ), obtain these equations by applying 
the variational principle. Compare the Lorentz gauge representation of the action of the 
field with the action of the elastic chain. What are the differences/parallels? 

Answer: 

Using the definition of the electromagnetic field tensor = d M A„ — d v A^, and integrating 
by parts, the action assumes the form 

S[A] = -Jd A x (-±A V [dpd»A v - + j»A^ . 

Owing to the Lorentz gauge condition, the second contribution in the square brackets van- 
ishes and we obtain 5 [A] = — / d 4 x (|d M Aj,d M A" + j M A M ), where we have again integrated 
by parts. Applying the general variational Eq. (1.17) one finally obtains the wave equation. 




36 



From, particles to fields 

Hamiltonian of electromagnetic field 



Here it is shown that the Hamiltonian canonically conjugate to the Lagrangian of the electromagnetic 
field indeed coincides with the energy density familiar from elementary electrodynamics. 

Consider the electromagnetic field in the absence of matter, j = 0. Verify that the total 
energy stored in the field is given by H = f d 3 x'U(x) where "H(x) = E 2 (x) + B 2 (x) is the 
familiar expression for the EM energy density. (Hint: Use the vacuum form of Maxwell’s 
equations and the fact that, for an infinite system, the energy is defined only up to surface 
terms.) 

Answer: 

Following the canonical prescription, let us first consider the Lagrangian density, 

3 3 

C = -\f^ v = 1 Y.OoAi - diAoKdoAi - d t A 0 ) -\Y, - djAiMAj - djAi). 

i= 1 i,j=l 

We next determine the components of the canonical momentum through the relation 7 = 
dg 0 A l: £ : 7r 0 = 0, 7Tj = d 0 Ai — diA 0 = E. Using the fact that d t Aj — djAi is a component 
of the magnetic field, the Hamiltonian density can now be written as 

U = n lx d 0 A lx -C= i(2E-d 0 A-E 2 + B 2 ) ( = } ^(2E- V</> + E 2 +B 2 ) 

= ^(2V • (E<f>) + E 2 + B 2 ), 

where (1) is based on addition and subtraction of a term 2E • V<j), and (2) on V ■ E = 0 
combined with the identity V • (a/) = V • a/ + a • V/ (valid for general vector [scalar] 
functions a (/)). Substitution of this expression into the definition of the Hamiltonian yields 

J d 3 x (E 2 + B 2 ) , 

where we have used the fact that the contribution V • (E<^) is a surface term that vanishes 
upon integration by parts. 



H=^Jd 3 x (2 V • (E <t>) + E 2 + B 2 ) = ^ 



Phonon specific heat 

In the text, we have stated that the mode quantization of elastic media manifests itself in low- 
temperature anomalies of the specific heat. In this problem, concepts of elementary quantum statistical 
mechanics are applied to determine the temperature profile of the specific heat. 

Compute the energy density u = — (1/L)9^1n Z of one-dimensional longitudinal phonons 
with dispersion = v\k\, where Z = tre~P H denotes the quantum partition function. As 
an intermediate result, show that the thermal expectation value of the energy density of 
the system can be represented as 

(y + WfenB ( Wfc )) ’ 

k 



(1.45) 




1.8 Problems 



37 



where ne(e) = (e^ e — l) 1 is the Bose-Einstein distribution function. Approximate the sum 
over k by an integral and show that c v ~ T. At what temperature T c \ does the specific heat 
cross over to the classical result c v = const? (Remember that the linear dispersion ivk = v\k\ 
is based on a quadratic approximation to the Hamiltonian of the system and, therefore, 
holds only for |fc| < A, where A is some cutoff momentum.) Reiterate the discussion of 
Section 1.4 for a d-dimensional isotropic solid of volume L d (i.e. assume that the atomic 
exchange constants remain the same in all directions) . Show that the dispersion generalizes 
to cut = f|k|, where k = (m, . . . ,nd)2.Tr/L and n* € Z. Show that the specific heat shows 
the temperature dependence c v ~ T d . 



Answer: 

As discussed in the text, the quantum eigenstates of the system are given by |ni,ri 2 , . . .), 
where n m is the number of phonons of wavenumber k m = 2-Km/L, E | ni)Tl2 ,...) = 

Em w t m (n m + 1/2) = X) m C m the eigenenergy, and w m = v\k m \. In the energy represen- 
tation, the quantum partition function then takes the form 



Z = tre-^= 



D -/3w m (n m + l/2) 



n 



e -/3w m /2 

— g 5 



where n m is the occupation number of the state with wavenumber k m . Hence, 



lnZ = -^[/3^+ln(l-e-^)' . 

m 



Differentiation with respect to (3 yields Eq. (1.45) and, replacing J2 m {L/Zn) J dk, we 
arrive at 



U ~ Cl + 2^. 



e pv\k\ _ l 



Here, C i is the temperature-independent constant accounting for the “zero-point energies” 
uj m / 2. In the second equality, we have scaled k —> f3k. This produces a prefactor f3~ 2 multi- 
plied by a temperature-independent (up to the temperature dependence of the boundaries 
A — > (3 A) integral that we denoted by C 2 ■ Differentiation with respect to T then leads to the 
relation c v = Btu ~ T. However, for temperatures T > vA higher than the highest frequen- 
cies stored in the phonon modes, the procedure above no longer makes sense (formally, due 
to the now non-negligible temperature dependence of the boundaries). Yet in this regime 
we may expand e^^’l — \ ~ 0v\k\, which brings us back to the classical result c v = const. 

Consider now a d-dimensional solid with “isotropic” exchange coupling (^f) l^R+e,— 

</>r,) 2 (e t : unit vector in the ^-direction) . Taking a continuum limit leads to a contribu- 
tion (fc s a 2 /2)(V0(x)) 2 . Proceeding along the lines of the text, we find that the relevant 
excitations are now waves with wave vector k = 27t(ii 1 , . . . ,nd)/L and energy = |k|v. 
Accordingly, the sum in our formula for the specific heat becomes ~ J d d k. Scaling 
ki —> j3 ki then generates a prefactor /3 _d_1 and we arrive at the relation c v ~ T d . 




38 



From particles to fields 

Van der Waals force 



We here discuss a semi-phenomenological approach to the van der Waals force: modelling the con- 
stituents participating in the interaction as quantum oscillators, we find that the ground state energy 
of the ensuing two-body system is lower than that of the isolated compounds. 



London’s phenomenological approach to explaining the van der Waals force starts out 
from the Hamiltonian 

Pi Pi r -2 „2\ ~ 

H = — H — (xj + x\) + mKx x x 2 , 

2 m 2 m 2 

where Xi is the coordinate of an oscillator phenomenologically representing one of two 
atoms i = 1,2. The atoms are coupled through a dipole-dipole interaction, where the 
dipole operators are proportional to Xi and K(r) = qe 2 /(mr 3 ) encapsulates the details of 
the interaction. Here r denotes the distance between the atoms and q is a dipole-dipole 
orientation factor. 

Compute the spectrum of the two-particle system described by H and verify that the 
dipole coupling leads to a lowering of the ground state energy by an amount V (r) = K 2 /8oj 3 . 
Using the fact that the classical polarizability of an oscillator is given by a = e 2 /muj 2 this 
becomes V = q 2 co 0 a 2 /8r 6 . Finally, using the fact that the directional average of the geo- 
metric factor q 2 equals 2, and multiplying by three to take account of the three-dimensional 
character of a “real” atomic oscillator, we recover the result Eq. (1.41). 



Answer: 

We begin by formulating the potential U of the oscillator system in a matrix notation, 
U = x T 4x, where x T = (xi,X 2 ) t and 





Diagonalizing the matrix, A = UDU T , and transforming coordinates, x — > U T x = x' , the 
system decouples into two independent oscillators, 




2m 




1/2 

where the new characteristic frequencies uq / 2 = ( w o ^ ^ ) are determined by the eigen- 
values of the matrix A. The ground state of this system is given by Eq = (w 1 + W2)/2 « 
w 0 — K 2 / 8Wq, which lies by an amount V = —A' 2 / (8Wq) lower than the energy of the 
isolated atoms. 




2 

Second quantization 



The purpose of this chapter is to introduce and apply the method of second quantization, a tech- 
nique that underpins the formulation of quantum many-particle theories. The first part of the chapter 
focuses on methodology and notation, while the remainder is devoted to the development of applications 
designed to engender familiarity with and fluency in the technique. Specifically, we will investigate the 
physics of the interacting electron gas, charge density wave propagation in one-dimensional quantum 
wires, and spin waves in a quantum Heisenberg (anti)ferromagnet. Indeed, many of these examples and 
their descendants will reappear as applications in our discussion of methods of quantum field theory in 
subsequent chapters. 



In the previous chapter we encountered two field theories that could conveniently be rep- 
resented in the language of “second quantization,” i.e. a formulation based on the algebra 
of certain ladder operators a*,. 1 There were two remarkable facts about this formulation: 
firstly, second quantization provides a compact way of representing the many-body space 
of excitations; secondly, the properties of the ladder operators were encoded in a simple 
set of commutation relations (cf. Eq. (1.33)) rather than in some explicit Hilbert space 
representation. 

Apart from a certain aesthetic appeal, these observations would not be of much relevance 
if it were not for the fact that the formulation can be generalized to a comprehensive and 
highly efficient formulation of many-body quantum mechanics in general. In fact, second 
quantization can be considered the first major cornerstone on which the theoretical frame- 
work of quantum field theory was built. This being so, extensive introductions to the concept 
can be found throughout the literature. We will therefore not develop the formalism in full 
mathematical rigor but rather proceed pragmatically by first motivating and introducing 
its basic elements, followed by a discussion of the “second quantized” version of standard 
operations of quantum mechanics (taking matrix elements, changing bases, representing 
operators, etc.). The second part of the chapter is concerned with developing fluency in the 
method by addressing a number of applications. Readers familiar with the formalism may 
therefore proceed directly to these sections. 



1 The term “second quantization” is unfortunate. Historically, this terminology was motivated by the observation 
that the ladder operator algebra fosters an interpretation of quantum excitations as discrete “quantized” units. 
Fundamentally, however, there is nothing like “two” superimposed quantization steps in single- or many-particle 
quantum mechanics. Rather, one is dealing with a particular representation of the “first and only quantized” 
theory tailored to the particular problem at hand. 



39 




40 



Second quantization 



2.1 Introduction to second quantization 
Motivation 



We begin our discussion by recapit- £ 
ulating some fundamental notions of 
many-body quantum mechanics, as for- e 4' 
mulated in the traditional language of e 3 ' 
symmetrized/anti-symmetrized wavefunc- 
tions. Consider the (normalized) set of e ~ 
wavefunctions |A) of some single-particle £ l ' 
Hamiltonian H : H |A) = £a|A), where e\ 
are the eigenvalues. With this definition, £ ° 
the normalized two-particle wavefunction 
V’f('0b) of two fermions (bosons) populating 
symmetrized (symmetrized) product 





n x 

0 


n \ 

0_ _ 




, 1 


1_ _ 

1_ _ 

1_ _ 

__ _ 1 _ _ 




, 1 




3_ _ 




H#-# -C 



bosons fermions 

levels Ai and A 2 is given by the anti- 



4>f(xi,x 2 ) 
iPb{x i,x 2 ) 



-^= ((xi|Ai)(x 2 |A 2 ) - (a;i|A 2 )(a; 2 |Ai)) , 
~^= ((xi|Ai)(x 2 |A 2 ) + (a;i|A 2 )(a; 2 |Ai)) . 



In the Dirac bracket representation, the two-body states |Ai,A 2 )f(b) corresponding to the 
wave functions ipF(B){xi, x 2 ) = ((aq| 0 ( x 2 \ ) |Ai, A 2 )f(b) above can be presented as 

I Ai, A 2 )f(b) = (l^i) ® |A 2 ) + Cl^ 2 ) 0 | Ai)) , 

where £ = — 1 for fermions while C, = 1 for bosons. 



Note that the explicit symmetriza- 
tion of the wavefunctions is neces- 
sitated by quantum mechanical 
indistinguishability: for fermions 
(bosons) the wave function has to be 
anti-symmetric (symmetric) under 
particle exchange. 2 More gener- 
ally, an appropriately symmetrized 
N - particle wavefunction can be 
expressed in the form 



Enrico Fermi 1901-54 

Nobel Laureate in Physics in 1938 
for his demonstrations of the exis- 
tence of new radioactive elements 
produced by neutron irradiation, 
and for his related discovery of 
nuclear reactions brought about 
by slow neutrons. (Image © The 
Nobel Foundation.) 




| Ai, A 2 , . . . , A at ) 



1 

t/wnr=oK!) 



c (1 ssn 7 ^ 2 |A-pi) 0 |A-p 2 ) 0 • • • 0 |Apat), 

v 



( 2 . 1 ) 



Notice, however, that in two-dimensional systems the standard doctrine of fully symmetric/anti-symmetric many 
particle wave functions is too narrow and more general types of exchange statistics can be realized, cf. our 
discussion on page 41. 





2.1 Introduction to second quantization 



41 

where n\ represents the total number of particles in state A (for fermions, Pauli exclusion 
enforces the constraint n\ < 1) - see the schematic figure above. The summation runs over 
all N\ permutations of the set of quantum numbers {Ai, . . . , Ayr}, and sgnV denotes the 
sign of the permutation V . (sgnP = 1 [—1] if the number of transpositions of two elements 
which brings the permutation (V\,V 2 ,...,Pn) back to its original form (1,2, ...,N) is 
even [odd].) The prefactor l/^/N\ J~[ a (?za!) normalizes the many-body wavefunction. In 
the fermionic case, wave functions corresponding to the states above are known as Slater 
determinants. 

Finally, notice that it will be use- 
ful to assume that the quantum 
numbers {A,} defining the state 
|Ai, A 2 , . . . , Xn) are ordered accord- 
ing to some convention. (For exam- 
ple, for A, = ij a one-dimensional 
coordinate representation, we might 
order according to the rule x\ < 
x 2 < ••• < Xn-) Once an ordered 
sequence of states has been fixed we 
may — for notational convenience - 
label our quantum states by integers, A = 1,2,.... Any initially non-ordered state (e.g. 

1 2, 1, 3)) can be brought into an ordered form (| 1, 2, 3)) at the cost of, at most, a change of 
sign. 

INFO For the sake of completeness, let us spell out the connection between the permutation 
group and many-body quantum mechanics in a more mathematical language. The basic 
arena wherein IV-body quantum mechanics takes place is the product space, 

n/N — -T/ nj 

n — rl- * rt ^ 

N copies 



David Hilbert 1862-1943 

He contributed to many branches 
of mathematics, including invari- 
ants, algebraic number fields, 
functional analysis, integral equa- 
tions, mathematical physics, and 
the calculus of variations. His 
work in geometry had the greatest influence in that 
area after Euclid. A systematic study of the axioms 
of Euclidean geometry led Hilbert to propose twenty 
one such axioms and he analyzed their significance. 




of N single-particle Hilbert spaces. In this space, we have a linear representation of the permu- 
tation group, S N , 3 assigning to each V £ S N the permutation (no ordering of the As implied at 
this stage), 

V'.'H n —¥ H N , | Ai) ••• | Ajv) >— t |A-pi) ••• |Apjv). 



3 Recall that a linear representation of a group G is a mapping that assigns to each g £ G a linear mapping 
p g : V V of some vector space V. For notational convenience one usually writes g : V — > V instead of 
p g : V —¥ V . Conceptually, however, it is often important to carefully discriminate between the abstract group 
elements g and the matrices (also g) assigned to them by a given representation. (Consider, for example the 
symmetry group G = SU(2) of quantum mechanical spin. SU(2) is the two-dimensional group of unitary matrices 
with determinant one. However, when acting in the Hilbert space of a quantum spin S = 5, say, elements of 
SU(2) are represented by (2 S + 1 = ll)-dimensional matrices.) Two representations p and p' that differ only by 
a unitary transformation, g £ G : p g = U p'gU -1 , are called unitary equivalent. If a transformation U can be 
found such that all representation matrices p g assume a block structure, the representation is called reducible, 
and otherwise irreducible. Notice that the different sub-blocks of a reducible representation by themselves form 
irreducible representation spaces. The identification of all distinct irreducible representations of a given group 
is one of the most important objectives of group theory. 





42 



Second quantization 



The identification of all irreducible subspaces of this representation is a formidable task that, 
thanks to a fundamental axiom of quantum mechanics, we need not address in full. All we need 
to know is that S N has two particularly simple one-dimensional irreducible representations: one 
wherein each V £ S N acts as the identity transform V( ) = and, another, the alternat- 
ing representation V{ ) = sgn"P • . According to a basic postulate of quantum mechanics, 

the state vectors £ W N describing bosons/fermions must transform according to the iden- 
tity/alternating representation. The subset J- N C H N of all states showing this transformation 
behavior defines the physical IV-body Hilbert space. To construct a basis of J - N , one may apply 
the symmetrization operator P s = ^2 v V (anti-symmetrization operator P a = '^2 v (sgnV)V) to 
the basis vectors |Ai) • • • |Ajv) of T-L N . Up to normalization, this operation obtains the states 
( 2 . 1 ). 

Some readers may wonder why we mention these representation-theoretic aspects. Being prag- 
matic, all we really need to know is the symmetrization/anti-symmetrization postulate, and its 
implementation through Eq. (2.1). Notice, however, that one may justly question what we actu- 
ally mean when we talk about the permutation exchange of quantum numbers. For example, 
when we compare wavefunctions that differ by an exchange of coordinates, we should, at least 
in principle, be able to tell by what physical operation we effect this exchange (for, otherwise, 
we cannot really compare them other than in a formal and, in fact, in an ambiguous sense). 

Oddly enough, decades passed before this crucial issue in quantum mechanics was critically 
addressed. In a now seminal work by Leinaas and Myrheim 4 it was shown that the standard 
paradigm of permutation exchange is far from innocent. In particular, it turned out that its appli- 
cability is tied to the dimensionality of space! Specifically, in two-dimensional spaces (in a sense, 
also in d = 1) a more elaborate scheme is needed. (Still one may use representation-theoretic 
concepts to describe particle exchange. However, the relevant group - the braid group - now 
differs from the permutation group.) Physically, these phenomena manifest themselves in the 
emergence of quantum particles different from both bosons and fermions. For a further discussion 
of these “anyons” we refer to Chapter 9. 

While representations like Eq. (2.1) can be used to represent the full Hilbert space of rnany- 
body quantum mechanics, a moment’s thought shows that this formulation is not at all 
convenient: 

> It takes little imagination to anticipate that practical computation in the language of 
Eq. (2.1) will be cumbersome. For example, to compute the overlap of two wavefunctions 
one needs to form no less than (TV! ) 2 different products. 

> The representation is tailor-made for problems with fixed particle number N. However, 
we know from statistical mechanics that for N = 0(1O 23 ) it is much more convenient to 
work in a grand canonical formulation where N is allowed to fluctuate. 

> Closely related to the above, in applications one will often ask questions such as, “What is 
the amplitude for injection of a particle into the system at a certain space-time coordinate 

followed by annihilation at some later time (£ 2 ,^ 2 )?” Ideally, one would work 
with a representation that supports the intuition afforded by thinking in terms of such 



J. M. Leinaas and J. Myrheim, On the theory of identical particles, II Nuovo Cimento B 37 (1977), 1—23. 




2.1 Introduction to second quantization 



43 



processes: i.e. a representation where the quantum numbers of individual quasi-particles 
rather than the entangled set of quantum numbers of all constituents are fundamental. 

The “second quantized” formulation of nrany-body quantum mechanics, as introduced in 
the next subsection, will remove all these difficulties in an elegant and efficient manner. 



The apparatus of second quantization 

Occupation number representation and Fock space 

Some of the disadvantages of the representation (2.1) can be avoided with relatively little 
effort. In our present notation, quantum states are represented by “iV-letter words” of the 
form 1 1, 1, 1, 1, 2, 2, 3, 3, 3, 4, 6, 6, . . .). Obviously, this notation contains a lot of redundancy. 
A more efficient encoding of the state above might read |4, 2, 3, 1, 0, 2, . . .), where the ith 
number signals how many particles occupy state number i; no more information is needed 
to characterize a symmetrized state. (For fermions, these occupation numbers take a value 
of either zero or one.) This defines the “occupation number representation.” In the 
new representation, the basis states of F N are specified by |n-i, 712 , . . .), where JA n l = N. 
Any state |T) in F N can be obtained by a linear superposition 

W = X! c n 1 ,n 2 ,...\ni,n 2 ,...). 

n l * n 2 . ■ ■ ■ 1 

As pointed out above, eventually we will want to emancipate ourselves from the condition 
of a fixed particle number N. A Hilbert space large enough to accommodate a state with 
an undetermined number of particles is given by 

OO 

F= F n . (2.2) 

jv=o 

Notice that the direct sum con- 
tains a curious contribution F°, 
the “vacuum space.” This is a one- 
dimensional Hilbert space which 
describes the sector of the theory 
with no particles present. Its single 
normalized basis state, the vacuum 
state, is denoted by |0). We will 
soon see why it is convenient to add 
this strange animal to our family of 
basis states. The space T is called 
Fock space and it defines the principal arena of quantum many-body theory. 

To obtain a basis of A, we need only take the totality of our previous basis states 
{|ni, ri 2 , • ■ .)}, and drop the condition ■ n* = N on the occupation numbers. A gen- 
eral many-body state | H/) can then be represented by a linear superposition | H/) = 



Vladimir Aleksandrovich Fock 
1898-1974 

One of the main participants in 
the history of the general theory 
of relativity in Russia. His ground- 
breaking contributions to many- 
body theory include the introduc- 
tion of Fock space and the development of per- 
haps the most important many-particle approxima- 
tion scheme, the Hartree-Fock approximation (see 
Chapter 5). 





44 



Second quantization 



J2 ni n 2 c n 1 ,n 2 ,-..\ n U n 2i ■ ■ ■)■ Notice that states of different particle numbers may con- 
tribute to the linear superposition forming |'b). We shall see that such mixtures play an 
important role, for example in the theory of superconductivity. 

Foundations of second quantization 

The occupation number representation introduced above provides a step in the right direc- 
tion, but it does not yet solve our main problem, the need for explicit symmetrization/anti- 
synnnetrization of a large number of states in each quantum operation. 

As a first step towards the construction of a more powerful representation, let us recall 
an elementary fact of linear algebra: a linear map A : V — > V of a vector space into itself is 
fully determined by defining the images u)j = Avi of the action of A on a basis {uj}. Now 
let us use this scheme to introduce a set of linear operators acting in Fock space. For every 
i = 1, 2, . . ., we define operators a\ : T — > F through 

<4lm, • • • ,ni, . . .) = {ni + l) 1/2 C Si |ni, ...,ni + 1, . . .}, (2.3) 

where Si = Xq=i n j ■ the fermionic case, the occupation numbers have to be understood 
mod 2. Specifically, (1 + 1) =0 mod 2, i.e. the application of a\ to a state with m = 1 
annihilates this state. 

Notice that by virtue of this definition we are able to generate every basis state of T by 
repeated application of a \ s to the vacuum state. (From a formal point of view, this fact 
alone is motivation enough to add the vacuum space to the definition of Fock space.) Indeed, 
repeated application of Eq. (2.3) leads to the important relation 

(2.4) 

Notice that Eq. (2.4) presents a strong statement: the complicated permutation “entangle- 
ment” implied in the definition (2.1) of the Fock states can be generated by straightforward 
application of a set of linear operators to a single reference state. Physically, TV-fold applica- 
tion of operators af to the empty vacuum state generates an 7V-particle state, which is why 
the a's are commonly called creation operators. Of course, the introduction of creation 
operators might still turn out to be useless, i.e. consistency with the properties of the Fock 
states (such as the fact that, in the fermionic case, the numbers Hi = 0, 1 are defined only 
mod 2), might invalidate the simple relation (2.3) with its (ni-independent!) operator aj. 
However, as we shall demonstrate below, this is not the case. 

Consider two operators af and aj for i ^ j. From the definition (2.3), one may readily 
verify that (ajaj — £ajaj)|ni, ri 2 , . . .) = 0. Holding for every basis vector, this relation implies 
that [at, ajk =0, where 

[A, B} ( = AB — (BA, 

i.e. [, ]f = i = [, ] is the commutator and [, ]c=-i = {, } = [, ]+ the anti-commutator. 
Turning to the case i = j, we note that, for fermions, the two-fold application of aj to any 
state leads to its annihilation. Thus, aj 2 = 0 is nilpotent, a fact that can be formulated as 






2.1 Introduction to second quantization 



45 



a a a 




Figure 2.1 Visualization of the generation of the Fock subspaces IF N by repeated action of creation 
operators on the vacuum space J-° . 

[at,o-] + = 0. For bosons we have, of course, [at, a-] = 0 (identical operators commute!). 
Summarizing, we have found that the creation operators obey the commutation relation 

Vbi : [aj,ot] c = 0. (2.5) 

Now, quantum mechanics is a unitary theory so, whenever one meets a new operator A , 
one should determine its Hermitian adjoint A' . To understand the action of the Hermitian 
adjoints (aj)^ = a,; of the creation operators we may take the complex conjugates of all 
basis matrix elements of Eq. (2.3): 

{ni, • • • ) tli, • • • [a.j 1%, • • • , /q, • . .) (?q d 1) C ■ • ■ ^rii,n / i + 1 • • • 

=> = nV 2 C Si <J„', ni • • • • • • 

Holding for every bra (n , 1 , . . . , n [, . . . | , the last line tells us that 

a»|«i, • • -,nu ■ . .) = n- /2 C s< |ni, . . . ,rq - 1, . . .), (2.6) 

a relation that identifies a t as an operator that “annihilates” particles. The action of creation 
and annihilation operators in Fock space is illustrated in Fig. 2.1. Creation operators 
a f : F N -1 increase the particle number by one, while annihilation operators a : 

J- N — > lower it by one; the application of an annihilation operator to the vacuum 

state, aj|0) = 0, annihilates it. (Do not confuse the vector |0) with the number zero.) 

Taking the Hermitian adjoint of Eq. (2.5) we obtain [aj,Oj]^ = 0. Further, a straight- 
forward calculation based on the definitions (2.3) and (2.6) shows that [aj,aj]^ = <5,j. 
Altogether, we have shown that the creation and annihilation operators satisfy the algebraic 
closure relation 

(2.7) 

Given that the full complexity of Fock space is generated by application of at s to a single 
reference state, the simplicity of the relations obeyed by these operators seems remarkable 
and surprising. 

Perhaps less surprising is that, behind this phenomenon, there lingers some mathematical 
structure. Suppose we are given an abstract algebra A of objects at, a] satisfying the relation 
(2.7). (Recall that an algebra is a vector space whose elements can be multiplied by each other.) 
Further suppose that A is irreducibly represented in some vector space V, i.e. that there is a 
mapping assigning to each ai £ A a linear mapping a; : V —> V, such that every vector |u) £ V 
can be reached from any other |w) £ V by (possibly iterated) application of operators ai and a\ 






46 



Second quantization 



(irreducibility). 5 According to the Stone— von Neumann theorem (a) such a representation 
is unique (up to unitary equivalence), and (b) there is a unique state |0) £ V that is anni- 
hilated by every m. All other states can then be reached by repeated application of ajs. The 
precise formulation of this theorem, and its proof - a good practical exercise in working with 
creation/annihilation operators - are left as Problem 2.4. From the Stone-von Neumann theo- 
rem, we can infer that the Fock space basis could have been constructed in reverse. Not knowing 
the basis (|ni, ri 2 , . . .)}, we could have started from a set of operators obeying the commutation 
relations (2.7) acting in some a priori unknown space T. Outgoing from the unique state 1 0) , 
the prescription (2.4) would then have yielded an equally unique basis of the entire space JF (up 
to unitary transformations). In other words, the algebra (2.7) fully characterizes the operator 
action and provides information equivalent to the definitions (2.3) and (2.6). 



Practical aspects 

Our next task will be to promote the characterization of Fock space bases introduced above 
to a full reformulation of many-body quantum mechanics. To this end, we need to find 
out how changes from one single-particle basis { | A)} to another { | A)} affect the operator 
algebra {oa}- (In this section we shall no longer use integers to identify different elements 
of a given single-particle basis. Rather, we use Greek labels A, i.e. creates a particle in 
state A.) Equally important, we need to understand in what way generic operators acting 
in many-particle Hilbert spaces can be represented in terms of creation and annihilation 
operators. 



> Change of basis: Using the resolution of identity id = X0 a^=q l^)(^l> the relations |A) = 
| A)(A| A), |A) = ajjO), and |A) = a- 10) immediately give rise to the transformation law 



a \ = 


a \ = 


A 


X 



( 2 . 8 ) 



In many applications, we will be dealing with continuous sets of quantum numbers (such 
as position coordinates). In these cases, the quantum numbers are commonly denoted 
by a bracket notation a\ — > a(x) = ^ A (x|A)aA an d summations appearing in the 
transformation formula above translate to integrals: a\ = J dx{X\x)a(x). 



EXERCISE The transformation from the coordinate to the Fourier momentum representation 
in a finite one-dimensional system of length L would read 



ak= dx(k\x)a(x), 

Jo 



a(x) = y^(xjk)ah, 

k 



(2.9) 



where (k\x) = (x\k)* = e xkx /\fL. 



5 To appropriately characterize the representation, we need to be a bit more precise. Within A , ai and at are 
independent objects, i.e. in general there exists no notion of Hermitian adjointness in A. We require, though, 
that the representation assigns to a\ the Hermitian adjoint (in V ) of the image of ai. Also, we have to require 
that [a*, at] £ A be mapped onto [aj,at] : V — > V where, in the latter expression, the commutator involves the 
ordinary product of matrices a*, at : V — > V. 





2.1 Introduction to second quantization 



47 



> Representation of operators (one-body): Single-particle or one-body operators 0\ 
acting in the 7V-particle Hilbert space F N generally take the form <b\ = Yln=i w h ere 
d n is an ordinary single-particle operator acting on the nth particle. A typical example 
is the kinetic energy operator T = ^ n p)(/2m, where p n is the momentum operator 
acting on the nth particle. Other examples include the one-particle potential operator 
V = '}2 n V (x n ), where V (x) is a scalar potential, the total spin operator '}2 n S n , etc. Since 
we have seen that, by applying held operators to the vacuum space, we can generate the 
Fock space in general and any iV-particle Hilbert space in particular, it must be possible 
to represent any operator 0\ in an a-representation. 

Now, although the representation of n-body operators is, after all, quite straightforward, 
the construction can, at first sight, seem daunting. A convenient way of finding such 
a representation is to express the operator in terms of a basis in which it is diagonal, 
and only later transform to an arbitrary basis. For this purpose it is useful to define the 
occupation number operator 



fix = a\a\ 



( 2 . 10 ) 



with the property that, for bosons or fermions (exercise), h\ (q A )"|0) = n(q A )"|0). 
Since h\ commutes with all Eq. (2.4) readily implies that h\ j \n\ 1 ,nx 2 , ■ • •) = 

n\j |n Al , n \ 2 ,...), i.e. , n\ simply counts the number of particles in state A (hence the name 
“occupation number operator”). Let us now consider a one-body operator, 0\ . which is 
diagonal in the basis | A) , with b = 0 A» | A») (A^ | , o\ i = (A* |o| A*). With this definition, 

one finds that 



( n Ax . |Oi|n Al , n \ 2 ,...) = ^ o Xi n Xi (n' Al , n Aa , . . . |n Al , n\ 2 ,...) 

i 

= (n' Xl , n'x 2 , • • • | ^2 l n ^i 

i 

Since this equality holds for any set of states, one can infer the second quantized repre- 
sentation of the operator Oi, 



Oi = ^2 o\h\ = y^(A|b|A)a^q A . 

A=0 A=0 

The result is straightforward: a one-body operator engages a single particle at a time - the 
others are just spectators. In the diagonal representation, one simply counts the number 
of particles in a state A and multiplies by the corresponding eigenvalue of the one-body 
operator. Finally, by transforming from the diagonal representation to a general basis, 
one obtains the general result, 



£>1 = J2(v\°\ v)a\a v 

[IV 



( 2 . 11 ) 






48 



Second quantization 



To cement these ideas, let us consider some specific examples: representing the matrix 
elements of the single-particle spin operator as ( Si) aa > = where a, a' is a two- 

component spin index and er,; are the Pauli spin matrices 




(2.12) 



the spin operator of a many-body 
system assumes the form 



S = 5>L' S a.' a^Aa • 



(Here, A denotes the set of additional 
quantum numbers, e.g. a lattice site 
index.) When second quantized in 
the position representation, one can 
show that the one-body Hamilto- 
nian for a free particle is given as a 
sum of kinetic and potential energy as 



Wolfgang Ernst Pauli 1900-58 

Nobel Laureate in Physics in 1945 
“for the discovery of the Exclusion 
Principle, also called the Pauli 
Principle." The exclusion princi- 
ple crystallized the existing knowl- 
edge of atomic structure at the 
time it was postulated and it led 
to the recognition of the two-valued variable required 
to characterize the state of an electron. Pauli was 
the first to recognize the existence of the neutrino. 
(Image © The Nobel Foundation.) 



H= f d d r aU r) — — hV(r) o(r), 



where p = —ihd. 



EXERCISh Starting with momentum representation (in which the kinetic energy is diagonal), 
transform to the position representation and thereby establish Eq. (2.14). 

The local density operator /5(r), measuring the particle density at a certain coordinate 
r, is simply given by 

p{ r) = o^(r)o(r). (2.15) 

Finally, the total occupation number operator, obtained by integrating over the particle 
density, is defined by IV = J d d r a^(r)a(r). In a theory with discrete quantum numbers, this 
operator assumes the form N = ^2 X a\a\. 



> Representation of operators (two-body): Two-body operators O 2 are needed to 
describe pairwise interactions between particles. Although pair-interaction potentials are 
straightforwardly included in classical many-body theories, their embedding into conven- 
tional many-body quantum mechanics is made cumbersome by particle indistinguishabil- 
ity. The formulation of interaction processes within the language of second quantization 
is considerably more straightforward: 

Initially, let us consider particles subject to the symmetric two-body potential 
V(r m ,r n ) = V(r n ,r m ) between two particles at position r m and r n . Our aim is to 
find an operator V in second quantized form whose action on a many-body state gives 





2.1 Introduction to second quantization 



49 



(presently, it is more convenient to use the original representation Eq. (2.1) rather than 
the occupation number representation) 

N i N 

y|ri,r 2 ,. . .rjv) = 53 V{r n , r m )|ri, r 2 , . . . r N ) = - 53 V(r n ,r m )\r 1 ,r 2 ,. . .r N ). 

n<m n^m 

When this is compared with the one-point function, one might immediately guess that 

V = ^ J d d r J d d r'a^ (r)a^ (r')V(r,r')a(r')a(r). 

That this is the correct answer can be confirmed by applying the operator to a many-body 
state: 



a t (r)a'’(r , )o(r')a(r)|ri, r 2 , . . . , r^r) = a 1 ’(r)a 1 ’(r')a(r , )a(r)ad(ri) • • • a^rAr)^) 

N 

= 53 C" _1( 5( r - r n) at ( r n) a t (r')a(r') • • • a t (r„_ 1 )a t (r„ +1 ) • • • a^r^lO) 

n=l 

N N 

= 53 C" _1( 5( r - r„) 53 S ( r ' _ r m)a t (r„)a t (ri) • • • a t (r„_ 1 )a t (r„ +1 ) • • • a^r^O) 

n—1 m(^n) 

N 

= 53 <H r - r n)<5(r / -r m )|ri,r 2 ,---rjv). 

n,m^n 



Multiplying by V(r,r , )/2, and integrating over r and r', one confirms the valid- 
ity of the expression. It is left as an exercise to confirm that the naive expression 
\ f d d r J d d r'V(r,r')p(r)p(r') does not reproduce the two-body operator. More generally, 
turning to a non-diagonal basis, it is straightforward to confirm that a general two-body 
operator can be expressed in the form 



= 53 0 n,n’,^' a i a i' a * a y - 

XX' fifi' 



(2.16) 



where a, A' = {p, p'\d 2 \X, X'). 

As well as the pairwise Coulomb interaction formulated above, another important 
interaction, frequently encountered in problems of quantum magnetism, is the spin— spin 
interaction. From our discussion of the second-quantized representation of spin S above, 
we can infer that the general spin-spin interaction can be presented in second-quantized 
form as 

V=7) j d d r J d d r 53 J (. r > r ')S*0 ■ s *'P' a i( r )ai'(r')a 0 '(r')ap(r), 

OtQi'fifi' 



where J(r, r') denotes the exchange interaction. 





50 



Second quantization 



In principle, one may proceed in the same manner and represent general ?r-body interactions 
in terms of second-quantized operators. However, as n > 2 interactions appear infrequently, 
we refer to the literature for discussion. 

This completes our formal introduction to the method of second quantization. To develop 
fluency in the operation of the method, we will continue by addressing a number of problems 
chosen from the realm of condensed matter. In doing so, we will see that second quantization 
often leads to considerable simplification of the analysis of many-particle systems. The effec- 
tive model Hamiltonians that appear below provide the input for subsequent applications 
considered in this text. Readers not wishing to get distracted from our main focus - the 
development of modern methods of quantum field theory in the condensed matter setting 

may safely skip the next sections and turn directly to Chapter 3 below. It is worthwhile 
keeping in mind, however, that the physical motivation for the study of various prototypical 
model systems considered later in the text is given in Section 2.2. 



2.2 Applications of second quantization 

Starting from the prototype Hamiltonian (1.1) introduced in Chapter 1, we have already 
explored generic aspects of lattice dynamics in condensed matter systems. In much of the 
remaining text we will explore examples from the complementary sector focusing on the 
electronic degrees of freedom. Drawing on the first of the principles discussed in Chapter 1, 
we will begin our discussion by reducing the full Hamiltonian to a form that contains the 
essential elements of the electron dynamics. As well as the pure electron sub-Hamiltonian 
H ei the reduced Hamiltonian will involve the interaction between the electrons and the 
positively charged ionic background lattice. However, typically, lattice distortions due to 
both the motion of the ions and the ion-ion interaction couple only indirectly. (Exercise: 
Try to think of a prominent example where the electron sector is crucially influenced by 
the dynamics of the host lattice.) To a first approximation, we may, therefore, describe the 
electron system through the simplified Hamiltonian, H = Hq + I4 e , where 



Ho 

V ee 



r( r ) 






r(r) 



\ J d d r j dVl4e(r-r')at(r)a| r/ (r , )a (T /(r , )a CT (r), j 






(2.17) 



V(r) = E/K i(Rj - r) denotes the lattice potential experienced by the electrons, and 
the coordinates of the lattice ions R/ are assumed fixed. For completeness, we have also 
endowed the electrons with a spin index, a =d / J.. The Hamiltonian defines the problem of 
the interacting electron gas embedded in a solid state system. 

Despite its seemingly innocuous structure, the interacting electron Hamiltonian (2.17) 
accommodates a wide variety of electron phases from metals and magnets to insulators. To 
classify the phase behavior of the model, it is helpful to divide our considerations, focusing 
first on the properties of the non-interacting single-particle system Ho and then, later, 
restoring the electron interaction I4e- 




2.2 Applications of second quantization 

Electrons in a periodic potential 



51 



As we know from Bloch’s theorem, 
eigenstates of a periodic Hamilto- 
nian can be presented in the form of 
Bloch waves 6 V’kn(i-) = e ikr ttk„(r), 
where the components of the crystal 
momentum k take values inside the 
Brillouin zone, ki € [— 7r/a, 7r/a], and 
we have assumed that the periodicity 
of the lattice potential is the same in all directions, i.e. V(r + aei ) = F(r). The index n 
labels the separate energy bands of the solid, and the functions Uk„(r + ae; ) = u k„(r) 
are periodic on the lattice. Now, depending on the nature of the bonding, there are two 
complementary classes of materials where the general structure of the Bloch functions can 
be simplified significantly. 



Felix Bloch 1905-83 

Nobel Laureate in Physics in 
1952 “for the development [with 
Edward M. Purcell] of new meth- 
ods for nuclear magnetic precision 
measurements and discoveries in 
connection therewith.” (Image @ 
The Nobel Foundation.) 




Nearly free electron systems 

For certain materials, notably the elemental metals drawn from groups I-IV of the periodic 
table, the outermost itinerant conduction electrons behave as if they were “nearly free,” i.e. 
their dynamic is largely oblivious to both the Coulomb potential created by the positively 
charged ion background and their mutual interaction. 

INFO Loosely speaking, Pauli exclusion of the bound state inner core electrons prevents the 
conduction electrons from exploring the region close to the ion core, thereby effectively screening 
the nuclear charge. In practice, the conduction electrons experience a renormalized pseudopo- 
tential, which accommodates the effect of the lattice ions and core electrons. Moreover, the high 
mobility of the conduction electrons provides an efficient method of screening their own mutual 
Coulomb interaction. In nearly free electron compounds, complete neglect of the lattice potential 
is usually a good approximation (as long as one considers crystal momenta remote from the 
boundaries of the Brillouin zone, ki = ±7r/a). 

In practice, this means that we are at liberty to set the Bloch function to unity, Ukn = 1, 
and regard the eigenstates of the non-interacting Hamiltonian as plane waves. This in turn 
motivates the representation of the field operators in momentum space Eq. (2.9), whereupon 
the non-interacting part of the Hamiltonian assumes the free particle form (once again, we 
have set h = 1) 

Ho = - — a i; a a k<7, (2-18) 

k 



For a more detailed discussion one may refer to one of the many texts on the basic elements of solid state physics, 
e.g., Ashcroft and Mermin, Solid State Physics (Holt-Saunders International, 1995). 





52 



Second quantization 



where the sum extends over all wavevectors k and, as usual, summation convention of the 
spin indices is assumed. Turning to the Coulomb potential between conduction electrons, 
in the Fourier representation, the two-body interaction takes the form 

^ee = yjy ^ e (9D a k- q er a L+qCT ,a k'cr'G!'k<T) (2-19) 

k.k'.q 

where (choosing units such that 47re = 1), V ee (q) = e 2 /q 2 represents the Fourier transform of 
the Coulomb interaction potential I4 e (r) = e 2 /|r|. Now, as written, this expression neglects 
the fact that, in ionized solids, the negative charge density of the electron cloud will be 
compensated by the charge density of the positively ionized background. The latter can 
be incorporated into Eq. (2.19) by placing on the sum over q the restriction that q ^ 0 
(exercise). Taken together, the free electron Hamiltonian Hq and the Coulomb interaction 
potential V ee are known as the Jellium model. 

The interaction described by Eq. (2.19) can be 
illustrated graphically, as shown in the figure (for 
a more elaborate discussion of such diagrams, 
see Chapter 5): an electron of momentum k is 
scattered into a new momentum state k— q while 
another electron is scattered from k' — > k' + q. 

In concrete applications of low- 
temperature condensed matter 
physics, one will typically consider 
low excitation energies. The solution 
of such problems is naturally orga- 
nized around the zero-temperature 
ground state as a reference platform. 

However, the accurate calculation of 
the ground state energy of the system is a complicated problem of many-body physics that 
cannot be solved in closed form. Therefore, assuming that interactions will not substantially 
alter the ground state of the free particle problem Eq. (2.18) - which is often not the case! - 
one usually uses the ground state of the latter as a reference state. 

I N F Deferring a more qualified discussion to later, a preliminary justification for this assump- 
tion can be given as follows. Suppose that the density of the electron gas is such that each of 
its N constituent particles occupies an average volume of 0(a d ). The average kinetic energy 
per particle is then estimated to be T ~ 1/ma 2 , while the Coulomb interaction potential will 
scale as V ~ e 2 /a. Thus, for a much smaller than the Bohr radius ao = l/e 2 m, the interac- 
tion contribution is much smaller than the average kinetic energy. In other words, for the dense 
electron gas, the interaction energy can indeed be treated as a perturbation. Unfortunately, for 
most metals one finds that a ~ ao and neither high- nor low- density approximations are strictly 
justified. 



Niels Henrik David Bohr 1885- 
1962 

Recipient of the 1922 Nobel Prize 
in Physics “for his services in the 
investigation of the structure of 
atoms and of the radiation ema- 
nating from them.” (Image © 
The Nobel Foundation.) 



£ 




k' + q, o' 




2.2 Applications of second quantization 



53 



The ground state of the system occupied by N 
non-interacting particles can be readily inferred 
from Eq. (2.18). The Pauli principle implies that all 
energy states £k = k 2 /2m will be uniformly occu- 
pied up to a cutoff Fermi energy, Ep. Specifically, 
for a system of size L, allowed momentum states k 
have components fcj = 27T rii/L, n* £ Z. The summa- 
tion extends up to momenta with |k| < kp, where 
the Fermi momentum kp is defined through the 
relation fcf/2rn = Ep (see figure at right). The 
relation between the Fermi momentum and the 
occupation number can be established by dividing 
the volume of the Fermi sphere ~ k/ by the momentum space volume per mode ( 27 r /L) d , 
viz. N = C(kpL) d where C denotes a dimensionless geometry-dependent constant. 

In the language of second quantization, the ground state can now be represented as 

[] ajjO), (2.20) 

|k| kp,a 




where |0) denotes the state with zero electrons present. When the interaction is weak, 
one may anticipate that low-temperature properties will be governed by energetically low- 
lying excitations superimposed upon the state | Q) . Therefore, remembering the philosophy 
whereby excitations rather than microscopic constituents play a prime role, one would like 
to declare the filled Fermi sea, |Q) (rather than the empty state |0)), to be the “physical vac- 
uum” of the theory. To make this compatible with the language of second quantization, we 
need to identify a new operator algebra c k( T, cj^ such that the new operators c k( r annihilate 
the Fermi sea. This can be easily engineered by defining 



c t = k>k F , 

kCT a kCT , k < kp, 



Ck<j — 



a ? 

°L> 



k > kp, 
k < kp. 



( 2 . 21 ) 



It is then a straightforward matter to verify that c k(T |fl) = 0, and that the canonical com- 
mutation relations are preserved. 

The Hamiltonian defined through Eq. (2.18) and (2.19), represented in terms of the 
operator algebra (2.21) and the vacuum (2.20), forms the basis of the theory of interactions 
in highly mobile electron compounds. The investigation of the role of Coulomb interactions 
in such systems will provide a useful arena to apply the methods of quantum field theory 
formulated in subsequent chapters. Following our classification of electron systems, let us 
now turn our attention to a complementary class of materials where the lattice potential 
presents a strong perturbation on the conduction electrons. In such situations, realized, 
for example, in transition metal oxides, a description based on “almost localized” electron 
states will be used to represent the Hamiltonian (2.17). 




54 



Second quantization 



Tight-binding systems 

Let us consider a “rarefied” lattice in which the constituent ion cores are separated by a 
distance in excess of the typical Bohr radius of the valence band electrons. In this “atomic 
limit” the weight of the electron wavefunctions is “tightly bound” to the lattice centers. 
Here, to formulate a microscopic theory of interactions, it is convenient to expand the 
Hamiltonian in a local basis that reflects the atomic orbital states of the isolated ion. Such 
a representation is presented by the basis of Wannier states defined by 

IV* n) = -L E e- ik ' R |VW>, I Vim) = -L E e,;k ' R |VRn), (2-22) 

where R denote the coordinates of the lattice centers, and Xk Z represents a summation 
over all momenta k in the first Brillouin zone. For a system with a vanishingly weak inter- 
atomic overlap, i.e. a lattice where V approaches a superposition of independent atomic 
potentials, the Wannier function V*r»( r ) — ( r IV’Rn) converges on the nth orbital of an iso- 
lated atom centered at coordinate R. However, when the interatomic coupling is non-zero, 
i.e. in a “real” solid, the N formerly degenerate states labeled by n split to form an energy 
band (see figure below). Here, the Wannier functions (which, note, are not eigenfunctions 
of the Hamiltonian) differ from those of the atomic orbitals through residual oscillations in 
the region of overlap to ensure orthogonality of the basis. Significantly, in cases where the 
Fermi energy lies between two energetically separated bands, the system presents insulat- 
ing behavior. Conversely, when the Fermi energy is located within a band, one may expect 
metallic behavior. Ignoring the complications that arise when bands begin to overlap, we 
will henceforth focus on metallic systems where the Fermi energy is located within a definite 
band no. 

How can the Wannier basis be exploited 
to obtain a simplified representation of the 
general Hamiltonian (2.17)? The first thing 
to notice is that the Wannier states (IVr™)} 
form an orthonormal basis of the single- 
particle Hilbert space, i.e. the transformation 
between the real space and the Wannier 
representation is unitary, |r) = Xr |V*)(V*l r ) = Xr V’r.MIV’r)- (Here, since we are 
interested only in contributions arising from the particular “metallic” band n o in which 
the Fermi energy lies, we have dropped the remaining set of bands n ^ no and, with them, 
reference to the specific band index.) (Exercise: By focusing on just a single band no, in 
what sense is the Wannier basis now complete?) As such, it induces a transformation 

4( r ) = V’r^L = X^( r K>’ ( 2 - 23 ) 

R i 

between the real space and the Wannier space operator basis, respectively. In the second 
representation, following a convention commonly used in the literature, we have labeled 
the lattice center coordinates R = R^ by a counting index i = 1 Similarly, the 





2.2 Applications of second quantization 



55 



unitary transformation between Bloch and Wannier states Eq. (2.22) induces an operator 
transformation 



Q k<r — 



1 ■ 

VN- 



ik-R, f 



a L = 



1 

y/N 



B.Z. 

E^‘ kR '«L- 

k 



(2.24) 



We can now use the transformation formulae (2.23) and (2.24) to formulate a Wannier 
representation of the Hamiltonian (2.17). Using the fact that the Bloch states diagonalize 
the single- particle component Hq, we obtain 



H 0 = Y1 £ fc«L a ka (2 = 4) ^ elk(R " Ri 'Wa«iV = 1 



''id tii' a i‘ 



where we have defined tu> = N _1 e lk ' (R i- R ;') efc . The new representation of H 0 describes 
electrons hopping from one lattice center i' to another, i. The strength of the hopping matrix 
element tu> is controlled by the effective overlap of neighboring atoms. In the extreme atomic 
limit, where the levels ek = const, are degenerate, tu> ex 5w and no inter-atomic transport is 
possible. The tight-binding representation becomes useful when is non-vanishing, but 
the orbital overlap is so weak that only nearest neighbor hopping effectively contributes. 



EXERCISE Taking a square lattice geometry, and setting ta> = —t for i, i' nearest neighbors 
and zero otherwise, diagonalize the two-dimensional tight-binding Hamiltonian Ho. Show that 
the eigenvalues are given by = —2t(cos(k x a) + cos (k v a)). Sketch contours of constant energy 
in the Brillouin zone and note the geometry at half-filling. 



To assess the potency of the tight-binding approximation, let us consider its application to 
two prominent realizations of carbon-based lattice systems, graphene and carbon nanotubes. 

INFO Graphene is a single layer of graphite: a planar hexagonal lattice of sp 2 -hybridized 7 
carbon atoms connected by strong covalent bonds of their three planar cr-orbitals. (See the 
Fig. 2.2 and figure overleaf for a schematic.) The remaining p z orbitals - oriented vertically to 
the lattice plane - overlap weakly to form a band of mobile electrons. For a long time, it was 
thought that graphene sheets in isolation will inevitably be destabilized by thermal fluctuations; 
only layered stacks of graphene would mutually stabilize to form a stable compound - graphite. 
It thus came as a surprise when in 2004 a team of researchers 8 succeeded in the isolation of 
large (micron-sized) graphene flakes on an SiC >2 substrate. (Meanwhile, the isolation of even free 
standing graphene layers has become possible. In fact, our whole conception of the stability of 
the compound has changed. It is now believed that whenever you draw a pencil line, a trail of 
graphene flakes will be left behind.) 

Immediately after its discovery, it became clear that graphene possesses unconventional con- 
duction properties. Nominally a gapless semiconductor, it has an electron mobility ~ 2 x 
10 5 cm 2 /Vs, by far more than that of even the purest silicon based semiconductors; it shows man- 
ifestations of the integer quantum Hall effect qualitatively different from those of conventional 
two-dimensional electron compounds (cf. Chapter 9 for a general discussion of the quantum Hall 



Although this will not be necessary to follow the text, readers may find it rewarding to recapitulate the quantum 
chemistry of sp 2 -hybridized carbon and its covalent bonds. 

K. S. Novoselov, et al., Electric field effect in atomically thin carbon films, Science 306 (2004), 666—9. 




56 



Second quantization 




Figure 2.2 Left: optical microscopy image of graphene flakes. Regions labeled by I’ define mono- 
layer graphene sheets. Right: STM image of the graphene samples shown in the left part. Images 
taken from E. Stolyarova et al, High-resolution scanning tunneling microscopy imaging of meso- 
scopic graphene sheets on an insulating surface, PNAS 104 (2007), 9210-12. 



effect), etc. Although an in-depth discussion of graphene is beyond the scope of this text, we note 
that it owes most of its fascinating properties to its band structure: electrons in graphene show 
a linear dispersion and behave like two-dimensional Dirac fermions! By way of an illustration of 
the concepts discussed above, we here derive this unconventional band dispersion from a tight 
binding formulation of the system. 



To a first approximation, graphene’s 7r-electron 
system can be modeled as a tight-binding Hamil- 
tonian characterized by a single hopping matrix 
element between neighboring atoms —t, and the 
energy offset e of the p-electron states. To deter- 
mine the spectrum of the system, one may intro- 
duce a system of bi-atomic unit cells (see the 
ovals in the schematic figure right) and two (non- 
orthogonal) unit vectors of the hexagonal lattice 
a! = (\/3, l)a/2 and a 2 = (-\/3, — l)a/2, where 
a = lari = |a 2 | ~ 2.46 A denotes the lattice con- 
stant. The tight-binding Hamiltonian can then 
be represented in the form H = — r ^(ta{(r)a 2 (r') + h.c.), where the sum runs over all 

nearest neighbor basis vectors of the unit cells (the coordinate vectors of the bottom left 
atom) and aj^(r) creates a state in the first (second) atom of the cell at vector r. Switching 
to a Fourier representation, the Hamiltonian assumes the form 




H ( a i<y a L 

kcr 



0 -*/(k) \ f aia \ 

k) 0 ) V a io ) ’ 



(2.25) 



_l_ g*(-fcl+fc2)a 



where /( k) = 1 + e lkia 






2.2 Applications of second quantization 



57 




Figure 2.3 (a) Spectrum of the tight-binding Hamiltonian (2.24) showing the point-like structure 
of the Fermi surface when Ef = 0. (b) A contour plot of the same. 

EXERCIf (Recall the concept of the reciprocal lattice in solid state theory.) To derive the 
Fourier representation above, show that a system of two reciprocal lattice vectors conjugate to the 
unit vectors above is given by G\/ 2 = -^(1, ±v3). Next, show that the Fourier decomposition 
of a field operator reads as a a ( r) = ^ Ek e _ ’® : * ,!lGl+,!2G2 *' r ttii, k , where ki £ [0, 27r/o] is 
quantized in units 27r /Li. (Li is the extension of the system in the direction of a; and N its 
total number of unit cells.) Substitute this decomposition into the real space representation of 
the Hamiltonian to arrive at the Fourier representation. 

Diagonalizing the Hamiltonian, one obtains the dispersion relation, 

e k = ±t|/(k)| = ±ty/ 3 + 2 cos(fcia) + 2 cos((fci — fe)a) + 2 cos(fc 2 «)- (2.26) 

Here, in contrast to the square lattice tight-binding Hamiltonian, the half-filled system is 
characterized by a point-like Fermi surface (see Figure 2.3). When lightly doped away from 
half-filling, the spectrum divides into Dirac-like spectra with a linear dispersion. Fig. 2.3 
shows the full spectrum of the tight binding Hamiltonian. Notice that of the six Dirac 
points displayed in the figure only two are independent. The complementary four can be 
reached from those two points by addition of a reciprocal lattice vector and, therefore, do 
not represent independent states. 

EXERCISE Derive an explicit representation of the Dirac-Hamiltonian describing the low- 
energy physics of the system. To this end, choose two inequivalent (not connected by reciprocal 
lattice vectors) zero-energy points k k 2 in the Brillouin zone. Expand the Hamiltonian (2.25) 
around these two points in small momentum deviations q = k k k 2 up to linear order. Show 
that in this approximation, H reduces to the sum of two two-dimensional Dirac Hamiltonians. 

Now, against this background, let us consider the carbon nanotube system. A single- wall 
nanotube describes a one-dimensional structure involving a graphene sheet rolled into a 
cylinder (see Fig 2.4 for an STM image of a carbon nanotube). Tubes of comparatively 
simple structure are obtained by rolling the hexagonal pattern of the sheet along one of its 
two axes of symmetry: along the a!-direction one obtains a zig-zag tube and along the 



P. R. Wallace, The band theory of graphite, Phys. Rev. 71 (1947), 622-34. 




58 



Second quantization 




Figure 2.4 STM image of a carbon nanotube contacted with Pt electrodes. (Source: Courtesy of 
C. Dekker.) 

(ai + a 2 )-direction an armchair tube (Exercise: Where do you see an armchair?). More 
complex, “chiral” structures involve twists along the circumference of the tube. 

By knowing the band structure of graphene, 
the dispersion of the nanotube system can 
be straightforwardly inferred. The situation is 
most easily visualized for the achiral geome- 
tries. Essentially a graphene sheet with peri- 
odic boundary conditions and length L in 
the direction transverse to the length of the 
tube, the quasi-one-dimensional system has 
a spectrum that obtains by projection of the 
two-dimensional dispersion onto lines indexed by the discrete values k± along the compact 
axis. For example, the dispersion of lowest transverse harmonic, k± = 0, of the armchair 
tube is given by (see the figure) 

efcn = ±£^3 + 4cos(fc||a/2) + 2cos(fc||a). 

Notice the presence of two nodal points in the spectrum. Electrons with longitudinal momen- 
tum close to one of these two “hot spots” propagate with approximately linear dispersion. 
The physics of effectively one-dimensional electron systems of this type will be discussed in 
Section 2.2 below. 

EXERCIf Verify the one-dimensional dispersion relation above. To this end, notice that 
the single-electron wave functions of the system must obey periodic boundary conditions if >( r + 
N( ai +a 2 )) = ip{ r )i where N is the number of cells in the transverse direction (i.e. L± = £V|ai + 
a 2 | = Nay/ 3 is the circumference of the tube). Use this condition to obtain the quantization rule 
fci + = 2ny/3m/ L±, where m is integer. When evaluated for the lowest harmonic, m = 0, 

the dispersion relation (2.26) collapses to (where fcy = k\ — &2 is the momentum in the 
longitudinal direction). Show that the zig-zag tube supports zero-energy excitations only for 
specific values of the transverse lattice spacings N. In general, it is insulating with a band gap 
of ca. 0.5 eV. In the case of the chiral tubes, it may be shown that a third of the tubes are 
semi-metallic while all others are insulating. 10 

10 For futher details on the electronic structure of carbon nanotubes, we refer to the text Physical Properties of 
Carbon Nanotubes by R. Saito, G. Dresselhaus, and M. S. Dresselhaus (Imperial College Press, 1998). 





2.2 Applications of second quantization 

Interaction effects in the tight-binding system 



59 



Although the pseudopotential of the nearly free electron system accommodates the effects 
of Coulomb interaction between the conduction and valence band electrons, the mutual 
Coulomb interaction between the conduction electrons themselves may lead to new physical 
phenomena. These effects can substantially alter the material parameters (e.g. effective 
masses), however, they neither change the nature of the ground state, nor that of the 
elementary quasi-particle excitations in any fundamental way - this is the content of Fermi- 
liquid theory, and a matter to which we will return. By contrast even weak interaction 
effects significantly influence the physics of the tight-binding system. Here, comnrensurability 
effects combined with interaction may drive the system towards a correlated magnetic state 
or an insulating phase. 

To understand why, let us re-express the interaction in field operators associated with 
the Wannier states. Once again, to keep our discussion simple (yet generic in scope), let 
us focus attention on a single sub-band and drop reference to the band index. Then, 
applied to the Coulomb interaction, the transformation (2.23) leads to the expansion V ee = 
E ii'jj' U ii'ji'A* a lc,' a Ov a i'<r' where 

Uii'jj' = ^ J d dr J d d r'ip R.(r)V>R,-(r)y(r- r')V’R i ,(r , )V’R ; ,., (!•')• ( 2 - 2 V 

Taken together, the combination of the contributions Hq and VA, 



11 4a 



+ ^ii'jj ,a icr a i'i7' a j'cr' a j 
ii’jj’ 



cri 



(2.28) 



where the sum of repeated spin indices is implied, defines the tight-binding representa- 
tion of the interaction Hamiltonian. Apart from the neglect of the neighboring sub-bands, 
the Hamiltonian is exact. Yet, to assimilate the effects of the interaction, it is useful to 
assess the relative importance of the different matrix elements drawing on the nature of the 
atomic limit that justified the tight-binding description. We will thus focus on contributions 
to Un’jj’ where the indices are either equal or, at most, nearest neighbors. Focusing on the 
most relevant of these matrix elements, a number of physically different contributions can 
be identified: 

> The direct terms Uu'w = Vu> involve integrals over square moduli of Wannier func- 
tions and couple density ’’uctuations at neighboring sites, 'Yfd+p Vu’hihi', where hi = 
XE a \ a a i< 7 • This contribution accounts for the - essentially classical - interaction between 
charges localized at neighboring sites (see Fig. 2.5). In certain materials, interactions of 
this type have the capacity to induce global instabilities in the charge distribution known 
as charge density wave instabilities. 

> A second important contribution derives from the exchange coupling, which induces 
magnetic correlations among the electron spins. Setting Jfj = Ujji , and making use of 





60 



Second quantization 





Figure 2.5 Different types of interaction mechanism induced by the tight-binding interaction V ee . 
The curves symbolically indicate wavefunction envelopes, (a) Direct Coulomb interaction between 
neighboring sites. Taking account of the exchange interaction, parallel alignment of spins (b) is 
preferred since it enforces anti-symmetry of the spatial wave function. By contrast, for anti-parallel 
spin configurations (c) the wave function amplitude in the repulsion zone is enhanced, (d) Coulomb 
interaction between electrons of opposite spin populating the same site. 



Pauli matrix identities (see below), one obtains 

Uijji a i(r a \ t ri a i<T' a j<T = ~ ^ Jg fSj • S j + — njTljJ . 

Such contributions tend to induce weak ferromagnetic coupling of neighboring spins 
(i.e. J F > 0). The fact that an effective magnetic coupling is born out of the electro- 
static interaction between quantum particles is easily understood. Consider two electrons 
inhabiting neighboring sites. The Coulomb repulsion between the particles is minimized 
if the orbital two-particle wave function is anti-symmetric and, therefore, has low ampli- 
tude in the interaction zone between the particles. Since the overall wavefunction must 
be anti-symmetric, the energetically favored real-space configuration enforces a symmet- 
ric alignment of the two spins. Such a mechanism is familiar from atomic physics where 
it is manifested as Hund’s rule. In general, magnetic interactions in solids are usually 
generated as an indirect manifestation of the much stronger Coulomb interaction. 

EXERCISE Making use of the Pauli matrix identity a a p ■ a~,s = 25^8/3^ — Sa^S^s, show that 
Si • S j = — a| a at paipaja/2 — fufij/ 4 where, as usual, S; = a\ a o a paip /2 denotes the operator 
for spin 1/2, and the lattice sites i and j are assumed distinct. 

> Finally, far into the atomic limit, where the atoms are very well separated, and the overlap 
between neighboring orbitals is weak, the matrix elements t%j and J F are exponentially 
small in the interatomic separation. In this limit, the “on-site” Coulomb or Hubbard 
interaction, Uuu = U/ 2, J2ia<r' = Si Uh^n^, generates the domi- 

nant interaction mechanism. Taking only the nearest neighbor contribution to the hopping 
matrix elements, and neglecting the energy offset due to the diagonal term, the effective 





2.2 Applications of second quantization 



61 



Hamiltonian takes a simplified form known as the Hubbard model, 



H = -tJ2 4r<V + U Y1 

<*j> * 



(2.29) 



where (ij) is a shorthand used to denote neighboring lattice sites. In hindsight, a model 
of this structure could have been proposed from the outset on purely phenomenological 
grounds: electrons tunnel between atomic orbitals localized on individual lattice sites 
while double occupancy of a lattice site incurs an energetic penalty associated with the 
mutual Coulomb interaction. 



Mott-Hubbard transition and the magnetic state 

Deceptive in its simplicity, the Hubbard model is acknowledged as a paradigm of strong 
electron correlation in condensed matter. Yet, after 40 years of intense investigation, the 
properties of this seemingly simple model system - the character of the ground state and 
nature of the quasi-particle excitations - are still the subject of controversy (at least in 
dimensions higher than one - see below). Nevertheless, given the importance attached to this 
system, we will close this section with a brief discussion of the remarkable phenomenology 
that is believed to characterize the Hubbard system. 

As well as dimensionality, the phase behavior of the Hubbard Hamiltonian is character- 
ized by three dimensionless parameters: the ratio of the Coulomb interaction scale to the 
bandwidth U /t, the particle density or filling fraction n (i.e. the average number of electrons 
per site), and the (dimensionless) temperature, T/t. The symmetry of the Hamiltonian 
under particle-hole interchange (exercise) allows one to limit consideration to densities in 
the range 0 < n < 1 while densities 1 < n < 2 can be inferred by “reflection.” 

Focusing first on the low tem- 
perature system, in the dilute limit 
n <C 1, the typical electron wave- 
length is greatly in excess of the 
site separation and the dynamics 
are free. Here the local interaction 
presents only a weak perturbation 
and one can expect the properties of 
the Hubbard system to mirror those 
of the weakly interacting nearly free 
electron system. While the inter- 
action remains weak one expects a 
metallic behavior to persist. 

By contrast, let us consider the half-filled system where the average site occupancy is 
unity. Here, if the interaction is weak, U/t <C 1, one may again expect properties reminiscent 



Sir Neville Francis Mott 1905- 
96 

Nobel Laureate in Physics in 
1977, with Philip W. Anderson 
and John H. van Vleck, for their 
"fundamental theoretical investi- 
gations of the electronic structure 
of magnetic and disordered sys- 
tems.” Amongst his contributions to science, Mott 
provided a theoretical basis to understand the transi- 
tion of materials from metallic to nonmetallic states 
(the Mott transition). (Image © The Nobel Foun- 
dation.) 






62 



Second quantization 



of the weakly interacting electron system. 11 If, on the other hand, the interaction is very 
strong, U/t^> 1, site double occupancy is inhibited and electrons in the half- filled system 
become “jammed”: migration of an electron to a neighboring lattice site necessitates site 
double occupancy incurring an energy cost U. Here, in this strongly correlated state, the 
mutual Coulomb interaction between the electrons drives the system from a metallic to an 
insulating phase with properties very different from those of a conventional band insulator. 

INF' Despite the ubiquity of the experimental phenomenon (first predicted in a celebrated 
work by Mott) the nature of the Mott— Hubbard transition from the metallic to the insulat- 
ing phase in the half-filled system has been the subject of considerable discussion and debate. 
In the original formulation, following a suggestion of Peierls, Mott conceived of an insulator 
characterized by two “Hubbard bands” with a bandwidth ~ t separated by a charge gap I/. 12 
States of the upper band engage site double occupancy while those states that make up the lower 
band do not. The transition between the metallic and the insulating phase was predicted to 
occur when the interaction was sufficiently strong that a charge gap develops between the bands. 
Later, starting from the weakly interacting Fermi-liquid, Brinkman and Rice 13 proposed that 
the transition was associated with the localization of quasi-particles created by an interaction 
driven renormalization of the effective mass. Finally, a third school considers the transition to 
the Mott insulating phase as inexorably linked to the development of magnetic correlations in 
the weak coupling system - the Slater instability. 

In fact, the characterizations of the transition above are not mutually exclusive. Indeed, in the 
experimental system, one finds that all three possibilities are, in a sense, realized. In particular, 
a transition between the Mott insulating phase and an itinerant electron phase can be realized 
in two ways. In the first case, one can reduce the interaction strength U/t while, in the second, 
one can introduce charge carriers (or vacancies) into the half-filled system. Experimentally, the 
characteristic strength of the interaction is usually tuned by changing the bandwidth t through 
external pressure (see Fig. 2.6), while a system may be doped away from half-filling by chemical 
substitution. Remarkably, by focusing on the scaling behavior close to the critical end-point, 
researchers have been able to show that the Mott transition in this system belongs to the uni- 
versality class of the three-dimensional Ising model (see the discussion of critical phenomena in 
Chapter 8). Lowering the temperature, both the Mott insulating phase and the strongly corre- 
lated metallic phase exhibit a transition to a magnetic phase where the local moments order 
antiferromagnetically (for the explanation of this phenomenon, see below). 

Experimentally, it is often found that the low-temperature phase of the Mott insulator is 
accompanied by the anti-ferromagnetic ordering of the local moments. The origin of 
these magnetic correlations can be traced to a mechanism known as superexchange and 
can be understood straightforwardly within the framework of the Hubbard model system. 



11 In fact, one has to exercise some caution since the commensurability of the Fermi wavelength with the lattice 
can initiate a transition to an insulating spin density wave state characterized by a small quasi-particle energy 
gap. In later chapters, we will discuss the nature of this Slater instability (J. C. Slater, Magnetic effects and 
the Hartree— Fock equation, Phys. Rev. 82 (1951), 538—41) within the framework of the quantum field integral. 

12 N. F. Mott, The basis of the electron theory of metals with special reference to the transition metals, Proc. 
Roy. Soc. A 62 (1949), 416-22 - for a review see, e.g., N. F. Mott, Metal-insulator transition, Rev. Mod. Phys. 
40 (1968), 677—83, or N. F. Mott, Metal-Insulator Transitions , 2nd ed. (Taylor and Francis, 1990). 

13 W. Brinkman and T. M. Rice, Application of Gutzwiller’s variational method to the metal— insulator transition, 
Phys. Rev. B 2 (1970), 4302-4. 




2.2 Applications of second quantization 



63 




q 

£ 



Figure 2.6 Conductivity of Cr-doped V 2 O 3 as a function of decreasing pressure and temperature. 
At temperatures below the Mott- Hubbard transition point ( P c = 3738 bar, T c = 457.5 K) the 
conductivity reveals hysteretic behavior characteristic of a first-order transition. (Reprinted from 
P. Limelette, A. Georges, D. Jerome, et al, Universality and critical behavior at the Mott transition, 
Science 302 (2003), 89-92. Copyright 2003 AAAS.) 



To this end, one may consider a simple “two-site” system from which the characteristics 
of the infinite lattice system can be inferred. For the two-site system, at half-filling (i.e. 
with just two electrons to share between the two sites), one can identify a total of six 
basis states: two spin-polarized states and four states with Sj? otal = 0: 

|si) = | S 2 ) = |di) = aj^a^|n), and ^ 2 ) = « 2 t a 24 l^)- Recalling 

the constraints imposed by the Pauli principle, it is evident that the fully spin polarized 
states are eigenstates of the Hubbard Hamiltonian with zero energy, while the remaining 
eigenstates involve superpositions of the basis states |s*) and \di). In the strong coupling 
limit U/t 1, the ground state will be composed predominantly of states with no double 
occupancy, |s*). To determine the precise structure of the ground state, we could simply 
diagonalize the 4x4 Hamiltonian exactly - a procedure evidently infeasible in the large 
lattice system. Instead, to gain some intuition for the extended system, we will effect a 
perturbation theory which projects the insulating system onto a low-energy effective spin 
Hamiltonian. Specifically, we will treat the hopping part of the Hamiltonian H t as a weak 
perturbation of the Hubbard interaction Hjj. 

To implement the perturbation theory, it is helpful to invoke a canonical transformation 
of the Hamiltonian, namely 

H i-> H' = e~ td He t6 = e~ t[6 ' ] H = H - t[6, H] + [6, [6, &]] + ■■■ , 



(2.30) 




64 



Second quantization 



where the exponentiated commutator is defined by the series expansion on the right. 

EXERCISE Prove the second equality. (Hint: Consider the derivative of H' with respect to t.) 

By choosing the operator O such that H t + t[Hu, 0\ = 0, all terms at first order in t can 
be eliminated from the transformed Hamiltonian. As a result, the effective Hamiltonian is 
brought to the form 

H' = H u + t -\H t ,0]+0{t 3 ). (2.31) 

Applying the ansatz, tO = [P s H t Pd— PdH t P s \/U, where P s and Pd are operators that project 
onto the singly and doubly occupied subspaces respectively, the first-order cancellation is 
assured. 

EXERCISI To verify this statement, take the matrix elements of the first-order equation with 
the basis states. Alternatively, it can be confirmed by inspection, noting that P s Pd = 0, HuP a = 
0, and, in the present case, P s HtP s = PdHtPd = 0. 

Substituting this result into Eq. (2.31) and projecting onto the singly occupied subspace 
one obtains 

P S H'P S = ~ P s H t P d H t P s = -2 ^P s (l + a\ <T al <7 ,a 1<T ,a 2<7 ) P s = J ^Si • S 2 - ^ , 

where J = 4 t 2 /U denotes the strength of the anti-ferromagnetic exchange interaction 
that couples the spins on neighboring sites. 

EXERCI: Remembering the anti-commutation relations of the electron operators, find the 

matrix elements of the Hubbard Hamiltonian on the four basis states |sj) and \di). Diagonalizing 
the 4x4 matrix Hamiltonian, obtain the eigenstates of the system. In the strong coupling system 
U/t 1, determine the spin and energy dependence of the ground state. 

The perturbation theory above shows 
that electrons subject to a strong 
local repulsive Coulomb interac- 
tion have a tendency to adopt an 
antiparallel or antiferromagnetic spin 
configuration between neighboring 
sites. This observation has a simple 
physical interpretation. Anti-parallel 
spins can take advantage of the 
hybridization (however small) and 
reduce their kinetic energy by hop- 
ping to a neighboring site (see Fig. 2.7). Parallel spins on the other hand are restricted from 
participating in this virtual process by the Pauli principle. This mechanism, which involves 
a two-step process, was first formulated by Anderson 14 and is known as superexchange. 



Philip W. Anderson 1923- 

Nobel Laureate in Physics in 
1977, with Sir Neville Mott and 
John H. van Vleck, for their 
“fundamental theoretical investi- 
gations of the electronic structure 
of magnetic and disordered sys- 
tems.” Anderson made numerous contributions to 
theoretical physics from theories of localization and 
antiferromagnetism to superconductivity. (Image © 
The Nobel Foundation.) 




14 



P. W. Anderson, Antiferromagnetism. Theory of superexchange interaction, Phys. Rev. 79 (1950) 350—6. 




2.2 Applications of second quantization 



65 




u - 1 

- H 



Figure 2.7 Top: hybridization of spin polarized states is forbidden by Pauli exclusion. Bottom: 
superexchange mechanism by which two antiparallel spins can lower their energy by a virtual 
process in which the upper Hubbard band is occupied. 





The calculation presented above is easily generalized to an extended lattice system. Once 
again, projecting onto a basis in which all sites are singly occupied, virtual exchange pro- 
cesses favor an antiferromagnetic arrangement of neighboring spins. Such a correlated mag- 
netic insulator is described by the quantum spin-(l/2) Heisenberg Hamiltonian 




(2.32) 



where, as usual, (mn) denotes a sum of neighboring spins on the lattice and the positive 
exchange constant J ~ t 2 /U. While, in the insulating magnetic phase, the charge degrees 
of freedom remain “quenched,” spin fluctuations can freely propagate. 

When doped away from half-filling, the behavior of the Hubbard model is notoriously 
difficult to resolve. The removal of electrons from the half-filled system introduces vacancies 
into the “lower Hubbard band” that may propagate through the lattice. For a low con- 
centration of holes, the strong coupling Hubbard system may be described by the effective 
t-J Hamiltonian, 



H t -j = ~t P s a\ ncr a nrj P s + J y~] S m • S n 

(mn) (mn) 



However, the passage of vacancies is frustrated by the antiferromagnetic spin correlations 
of the background. Here transport depends sensitively on the competition between the 
exchange energy of the spins and the kinetic energy of the holes. Oddly, at J = 0 (i.e. 
U = oo), the ground state spin configuration is known to be driven ferromagnetic by a 
single hole while, for J > 0, it is generally accepted that a critical concentration of holes is 
required to destabilize antiferromagnetic order. 

EXERCISE At U = oo, all 2 n states of the half-filled Hubbard model are degenerate - each 
site is occupied by a single electron of arbitrary spin. This degeneracy is lifted by the removal 
of a single electron from the lower Hubbard band. In such a case, there is a theorem due to 









66 



Second quantization 



Nagaoka 15 that, on a bipartite lattice (i.e. one in which the neighbors of one sublattice belong to 
the other sublattice), the ground state is ferromagnetic. For a four-site “plaquette” with three 
electrons determine the eigenspectrum of the Hubbard system with U = oo within the manifold 
(a) S^otai = 3/2, and (b) S/otai = 1/2. In each case, determine the total spin of the ground state. 
(Hint: In (b) there are a total of 12 basis states - here it useful to arrange these states in the 
order in which they are generated by application of the Hamiltonian.) 



INFO The rich behavior of the Mott-Hubbard system is nowhere more exemplified than in the 
ceramic cuprate system - the basic material class of the high-temperature superconductors. 
Cuprates are built of layers of Cu02 separated by heavy rare earth ions such as lanthanum. Here, 
the copper ions adopt a square lattice configuration separated by oxygen ions. At half-filling, 
electrons in the outermost occupied shell of the copper sites in the plane adopt a partially filled 
3d 9 configuration, while the oxygen sites are completely filled. Elevated in energy by a frozen 
lattice distortion, the Fermi energy lies in the d x 2_ y 2 orbital of copper. According to a simple 
band picture, the single band is exactly half-filled (one electron per Cu site) and, therefore, 
according to the standard band picture, should be metallic. However, strong electron interaction 
drives the cuprate system into an insulating antiferromagnetic Mott-Hubbard phase. 

When doped away from half-filling (by, for example, replacing the rare earth atoms by others 
with a different stoichiometry; see the figure, where the phase diagram of I^-^Sr^CuCn is shown 
as a function of the concentration x of Sr atoms replacing La atoms and temperature), charge 
carriers are introduced into the “lower Hubbard band.” However, in this case, the collapse of the 
Hubbard gap and loss of antiferromagnetic (AF) order is accompanied by the development of a 
high-temperature unconventional superconducting (SC) phase whose mechanism is believed to be 
rooted in the exchange of antiferromagnetic spin fluctuations. Whether the rich phenomenology 
of the cuprate system is captured by the Hubbard model remains a subject of great interest and 
speculation. 



T 




This concludes our preliminary survey of the rich 
phenomenology of the interacting electron system. 
Notice that, so far, we have merely discussed ways 
to distill a reduced model from the original micro- 
scopic nrany-body Hamiltonian (2.17). However, save 
for the two examples of free field theories analyzed 
in Chapter 1, we have not yet learned how meth- 
ods of second quantization can be applied to actually 
solve problems. To this end, in the following section 
we will illustrate the application of the method on a 
prominent strongly interacting problem. 



15 



Y. Nagaoka, Ferromagnetism in a narrow, almost half-filled s-band, Phys. Rev. 147 (1966), 392—405. 




2.2 Applications of second quantization 

Interacting fermions in one dimension 



67 



Within the context of many-body physics, a theory is termed free if the Hamiltonian is 
bilinear in creation and annihilation operators, i.e. H ~ where H may be 

a finite- or infinite-dimensional matrix. 16 Such models are “solvable” in the sense that the 
solution of the problem simply amounts to a diagonalization of the matrix (subject to 
the preservation of the commutation relations of the operators a and ad). However, only a 
few models of interest belong to this category. In general, interaction contributions typically 
quartic in the field operators are present and complete analytical solutions are out of reach. 

Yet there are a few precious examples of genuinely interacting systems that are amenable 
to (nearly) exact solution. In this section we will address an important representative of 
this class, namely the one-dimensional interacting electron gas. Not only is the analysis of 
this system physically interesting but, in addition, it provides an opportunity to practice 
working with the second quantized operator formalism on a deeper level. 

Qualitative discussion 

Consider the nearly free electron Hamiltonian (2.18) and (2.19) reduced to a one-dimensional 
environment. Absorbing the chemical potential E-p into the definition of the Hamiltonian, 
and neglecting spin degrees of freedom (e.g. one might consider a fully spin polarized band), 

H = J2 a i (^rn~ Ef ) ak + Jl ^ v (<l) a l- q a l’+ q a k' a k ■ (2.33) 

k ' ' kk' ,q^ 0 

INFO At first sight, the treatment of a one-dimensional electron system may seem an aca- 
demic exercise. However, effective one-dimensional interacting fermion systems are realized in 
a surprisingly rich spectrum of materials. We have already met with carbon nanotubes 
above. A nanotube is surrounded by clouds of mobile electrons (see earlier discussion in sec- 
tion 2.2). With the latter, confinement of the circumferential direction divides the system into 
a series of one-dimensional bands, each classified by a sub-band index and a wavenumber k. 
At low temperatures, the Fermi surface typically intersects a single sub-band, allowing atten- 
tion to be drawn to a strictly one-dimensional system. A similar mechanism renders certain 
organic molecules (such as the Bechgaard salt (TMTSF) 2 PF 6 , where TMTSF stands for the 
tetramethyl-tetraselenafulvalene) one-dimensional conductors . 

A third, solid state, realization is presented by artificial low-dimensional structures fab- 
ricated from semiconducting devices. Redistribution of electron charge at the interface of a 
GaAs/AlGaAs heterostructure results in the formation of a two-dimensional electron gas. 
By applying external gates, it is possible to fabricate quasi-one-dimensional semiconductor 
quantum wires in which electron motion in the transverse direction is impeded by a large 
potential gradient (Fig. 2.8 (a)). At sufficiently low Fermi energies, only the lowest eigenstate 
of the transverse Schrodinger equation (the lowest “quantum mode”) is populated and one is 
left with a strictly one-dimensional electron system. There are other realizations, such as edge 
modes in quantum Hall systems, “stripe phases” in high-tempterature superconductors, or 
certain inorganic crystals, but we shall not discuss these here explicitly. 

16 More generally, a free Hamiltonian may also contain contributions ~ u. jt a u and a ) a ( . 




68 



Second quantization 



(b) 

Figure 2.8 Different realizations of one-dimensional electron systems, (a) Steep potential well (real- 
izable in, e.g., gated two-dimensional electron systems), (b) (Approximately) cylindrical quantum 
system (carbon nanotubes, quasi-one-dimensional molecules, etc.). In both cases, the single-particle 
spectrum is subject to mechanisms of size quantization. This leads to the formation of “minibands” 
(indicated by shaded areas in the figure), structureless in the transverse direction and extended in 
the longitudinal direction. 

The one-dimensional fermion system exhibits a number of features not shared by higher- 
dimensional systems. The origin of these peculiarities can be easily understood from a simple 
qualitative picture. Consider an array of interacting fermions confined to a line. To optimize 
their energy the electrons can merely “push” each other around, thereby creating density 
fluctuations. By contrast, in higher-dimensional systems, electrons are free to avoid contact 
by moving around each other. A slightly different formulation of the same picture can be 
given in momentum space. The Fermi “sphere” of the one-dimensional system is defined 
through the interval [— fcp,/cF] of filled momentum states. The Fermi “surface” consists 
of two isolated points, {&f,— &f} (see the figure below). By contrast, higher-dimensional 
systems typically exhibit continuous and simply connected Fermi surfaces. It takes little 
imagination to anticipate that an extended Fermi sphere provides more phase space to 
two-particle interaction processes than the two isolated Fermi energy sectors of the one- 
dimensional system. The one-dimensional electron system represents a rare exception of an 
interacting system that can be solved under no more than a few, physically weak, simplifying 
assumptions. This makes it a precious test system on which non-perturbative quantum 
manifestations of many-body interactions can be explored. 

Quantitative analysis 

We now proceed to develop a quantitative picture of the charge density excitations of 
the one-dimensional electron system. Anticipating that, at low temperatures, the relevant 
dynamics will take place in the vicinity of the two Fermi points {/cf, — /cf}, the Hamiltonian 
(2.33) can be reduced further to an effective model describing the propagation of left and 
right moving excitations. To this end, we first introduce the notation that the subscripts 
R/L indicate that an operator a| + /_) fcp+(? creates an electron that moves to the right/left 
with velocity ~ vf = k-p/m. 

We next observe (see the figure below) that, in the immediate vicinity of the Fermi points, 
the dispersion relation is approximately linear, implying that the non-interacting part of 








2.2 Applications of second quantization 



69 



the Hamiltonian can be represented as (exercise) 

Ho- ^2 '22 a U a sV F qa sq , (2.34) 

«=R,L q 

where a s = (+/— ) for s = R/L and the summation over q is restricted by some momentum 

cut-off |g| < r beyond which the linearization of 
the dispersion is invalid. (Throughout this section, 
all momentum summations will be subject to this 
constraint.) Turning to the interacting part of the 
Hamiltonian, let us first define the operator 

Psq = ’22 a ik+q (lsk - (2.35) 

k 

_ k , k Crucially, the definition of these operators is not 

just motivated by notational convenience. It is 
straightforward to verify (exercise) that p s {q) is obtained from the Fourier transform of the 
local density operator p{ x). In other words, p sq measures density fluctuations of character- 
istic wavelength q~ l supported by electron excitations with characteristic momentum ±fcp 
(see Fig. 2.9 (a)). From our heuristic argument above, suggesting charge density modula- 
tions to be the basic excitations of the system, we expect the operators p sq to represent the 
central degrees of freedom of the theory. 

Represented in terms of the density operators, the interaction contribution to the Hamil- 
tonian may be recast as 

bee = 7^ 'y ' bee(?) a k-q a k l +q a k’ a k = ^ 1 [diPsqPs-q + 92psqPs—q\ i (2.36) 

kk'q qs 

where s = L/R denotes the complement of s = R/L, and the constants g 2 and 54 measure 
the strength of the interaction in the vicinity of the Fermi points, i.e. where q ~ 0 and 
q ~ 2 k F . (With the notation 32,4 we follow a common convention in the literature.) 

EXERCISE Explore the relation between the coupling constants 32 , <?4 and the Fourier transform 
of V ee . Show that to the summation J Zkk'q > no ^ on ly terms with ( k,k',q ) ~ (±fcF,±fcF,0), but 
also terms with ( k , k' , q) ~ (±A:f, T^'f, 2A;f) contribute. When adequately ordered (do it!), these 
contributions can be arranged into the form of the right-hand side of Eq. (2.36). (For a detailed 
discussion see, e.g., T. Giamarchi, Quantum Physics in One Dimension (Oxford University Press, 
2004) or G. Mahan, Many Particle Physics (Plenum Press, 1981)). At any rate, the only point 
that matters for our present discussion is that the interaction can be represented through density 
operators with positive constants 52,4 determined by the interaction strength. 




Working with second quantized theories, one frequently needs to compute commutators of 
operators A(a, ad) polynomial in the elementary boson/fermion operators of the theory (e.g. A = 
aad, A = asaaW , etc. where we have omitted the quantum number subscripts generally carried 
by a and a'). Such types of operation are made easier by a number of operations of elementary 




70 



Second quantization 



(a) 




(b) 




Figure 2.9 Two different interpretations of the excitations created by the density operators p sq . 
(a) Real space; p sq creates density modulations of characteristic wavelength q _1 and characteristic 
velocity uf- (b) Momentum space; application of p 3q to the ground state excites electrons from 
states k to k + q. This creates particle-hole excitations of energy tk+ q — tk = vf q independent 
of the particle/hole momentum k. Both particles and holes forming the excitation travel with the 
same velocity vf, implying that the excitation does not disperse (i.e. decay). 



commutator algebra. The most basic identity, from which all sorts of other formulae can be 
generated recursively, is the following: 



[A,BC\ ± = [A,B] ± CTB[A,d\- 



Iteration of this equation for boson operators a, a ^ shows that 



(2.37) 



la^ ,a n 



(2.38) 



(Due to the fact that a 2 = 0 in the fermionic case, there is no fermion analog of this equation.) 
Taylor expansion then shows that, for any analytic function F(a), [o^, F(a)] = —F'(a). Similarly, 
another useful formula which follows from the above is the relation a!* F(aaf) = F(a 1f a)a^ , which 
is also verified by series expansion. 



So far, we have merely rewritten parts of the Hamiltonian in terms of density operators. 
Ultimately, however, we wish to arrive at a representation whereby these operators, instead 
of the original electron operators, represent the fundamental degrees of freedom of the 
theory. Since the definition of the operators p involves the squares of two Fermi operators, we 
expect the density operators to resemble bosonic excitations. Thus, as a first and essential 
step towards the construction of the new picture, we explore the commutation relations 
between the operators p sq . 

From the definition (2.35) and the auxiliary identity (2.37) it is straightforward to verify 
the commutation relation [p sq , p s > q >] = 5 SS ' J2k( a lk+q a sk-q' ~ a lk+q+q ,a sk)- As ^ stands, this 
relation is certainly not of much practical use. To make further progress, we must resort to a 
(not very restrictive) approximation. Ultimately we will want to compute some observables 
involving quantum averages taken on the ground state of the theory, ($7| . . . |fi). To simplify 
the structure of the theory, we may thus replace the right-hand side of the relation by its 
ground state expectation value: 

\Psqi Ps' q'\ ~ &ss' ^ ^ (U|Q g ^.- ) _ Q Qgfc— q' Q s k-\-q-\-q' ®sk | = ^ss'^q : — q' ^ ^ (U| {flsk+q ^sk) |^} 5 

k k 

where, as usual, n s k = a/ sk a s k , and we have made use of the fact that (fl|e4 fc a s fc' |fl) = Skk'- 
Although this is an uncontrolled approximation, it is expected to become better the closer we 
stay to the zero-temperature ground state |fl) of the theory (i.e. at low excitation energies). 







2.2 Applications of second quantization 



71 



EXERCISE Try to critically assess the validity of the approximation. (For a comprehensive 
discussion, see the text by Giamarchi. 17 ) 

At first glance, it would seem that the right-hand side of our simplified commutator relation 

? 

actually vanishes. A simple shift of the summation index, 5Z fe (S7|n s fc+ g |f2) = Sfc(^l ?7 sfc|f^) 
indicates that the two terms contributing to the sum cancel. However, this argument is 
certainly too naive. It ignores the fact that our summation is limited by a cut-off momentum 
r. Since the shift k —> k — q changes the cut-off, the interpretation above is invalid. 

To obtain a more accurate result, let us consider the case s = R and q > 0. We know 
that, in the ground state, all states with momentum k < 0 are occupied while all states 
with k > 0 are empty. This implies that 

y^(Cl\(nRk+g - flRk) |Q) = X] + X] + X] (^\{^Rk+q ~ nRk)\ty 

k — r <k —q —q<k 0 0 <k<F 

= ^2 ~ hRk)\ty = i 

— q k 0 

where, with the last equality, we have used the fact that a momentum interval of size q 
contains q/(2ir/L) quantized momentum states. Similar reasoning for s = L shows that the 
effective form of the commutator relation reads 

\psq-) Ps'q'] — fiss'&q, — q' & s X • (2.39) 

Z7T 

Now, if it were not for the g-dependence of the right-hand side of this relation, we would 
indeed have found (approximate) bosonic commutation relations. Therefore, to make the 
connection to bosons explicit, let us define 

bq = n-q pLqi = ' l T'qPh( — q)i ^ 

\ (2.40) 

b—q = ^qPRt — q): b—q = rigPRg, j 

where q > 0 and n q = (2n/ Lq) 1 / 2 . It is easily confirmed that the newly defined operators b q 
obey canonical commutation relations (exercise), i.e. we have indeed found that, apart from 
the scaling factors n q , the density excitations of the system behave as bosonic “particles.” 
Expressed in terms of the operators b , the interaction part of the Hamiltonian takes the 
form (exercise) 




Notice that we have succeeded in representing a genuine two-body interaction, a contribution 
that usually renders a model unsolvable, in terms of a quadratic representation. However, 



17 



T. Giamarchi, Quantum Physics in One Dimension (Oxford University Press, 2004). 




72 



Second quantization 



the free boson representation of the interaction term will be of little use until the kinetic 
part of the Hamiltonian Hq is represented in terms of the b operators. There are various 
ways of achieving this goal. The most straightforward route, a direct construction of a 
representation of Hq in terms of the Bose operators, is cumbersome in practice. However, 
there exists a more efficient way that is based on indirect reasoning. As follows from the 
discussion of Section 2.1, the properties of second quantized operators are fixed by their 
commutation relations. 18 So what we are going to do is search for an operator H' 0 (b,b^) 
that has the same commutation relations with the boson operators (b,tf) as the original 
kinetic energy operator H 0 (a,a'). Using Eq. (2.34), the definition (2.35), and the auxiliary 
identity (2.37), it is straightforward to verify that [Ho,p sq ] = qvpa s p S q- On the other hand, 
using Eq. (2.39) one finds that the same commutation relations hold with the operator 

irt _ 27 ™F V" A A 
n 0 ~ 2 J / / PsqPs-q i 

qs 

i.e. [H' 0 , p S q] = qvF<j s p sq . Following the logic of our argument we thus identify Hq = H' 0 (up 
to inessential constants) and substitute H r 0 for the non-interacting Hamiltonian. 

EXERCISE To gain some confidence in the identification Ho = Hq-I- const., and to show that the 
undetermined constant actually equals zero, compute the energy expectation value of the state 
| sq ) = p 3q | ) both as ( sgl-ffol S q) and as ( sq\H'o\ sq)- Confirm that the two expressions 
coincide. 



Finally, using Eq. (2.40) and adding the interaction contribution V ee we arrive at the effective 
Hamiltonian 



# = £</(&, bl q ) 

q> 0 



92 _ 

2tt 



92 _ 

2tt 

^F+f^ 




(2.41) 



We have thus succeeded in mapping the full interacting problem onto a free bosonic theory. 
The mapping a — > p — > b is our first example of a technique known as bosonization. 
Such techniques play an important role in 2(= 1 space + 1 time)-dimensional field theory in 
general. More sophisticated bosonization schemes will be discussed in Sections 4.3 and 9.4.4. 
Conversely, it is sometimes useful to represent a boson problem in terms of fermions via 
fermionization. One may wonder why it is indeed possible to effortlessly represent the low- 
lying excitations of a gas of fermions in terms of bosons. Fermi— Bose transmutability 
is indeed a peculiarity of one-dimensional quantum systems. Particles confined to a line 
cannot pass “around” each other. That means that the whole issue of sign factors arising 
from the interchange of particle coordinates does not arise, and much of the exclusion-type 



18 This argument can be made quantitative by group theoretical reasoning: Eq. (2.4) and (2.7) define the irre- 
ducible representation of an operator algebra — an algebra because [ , ] defines a product in the space of 
generators {ax, a^}, a representation because the operators act in a vector space (namely Fock space T ), which 
is irreducible because all states |Ai, . . . , A jv) £ T can be reached by iterative application of operators onto a 
unique reference state (e.g. |f2)). Under these conditions, Schur’s lemma — to be discussed in more detail in 
Chapter 4 — states that two operators A\ and A 2 having identical commutation relations with all {ax, a^} are 
equal up to a constant. 




2.2 Applications of second quantization 



73 



characteristics of the Fermi system are inactivated. A more systematic formulation of Fermi 
•f-*- Bose transformations will be discussed in Chapter 4. 

Now, there is one last problem that needs to be overcome to actually solve the interacting 
problem. In Chapter 1, we learned how to interpret Hamiltonians of the structure ^2 b^bq as 
superpositions of harmonic oscillators. However, in our present problem, terms of the type 
bqb—q and b 1 appear. To return to familiar terrain, we need to eliminate these terms. 

However, before doing so, it is instructive to discuss the physical meaning of the problem. 

Firstly, let us recall that the total number operator of a theory described by operators 
b\, b\ is given by N = b\b\. Now, if the Hamiltonian has the form H = b^H^b v , 

the total number operator commutes with H , i.e. [N, H] = 0 (exercise). This means that H 
and N can be simultaneously diagonalized, or, in more physical terms, that the Hamiltonian 
enjoys the feature of particle number conservation. More generally, any Hamiltonian in 
which operators appear as polynomials containing equal numbers of creation and annihila- 
tion operators (e.g. b^b^bb,b^b^b^bbb, etc.) has this property. This is because any operator 
of this structure creates as many particles as it annihilates. In problems where the total 
number of particles is conserved (e.g. the theory of interacting electrons in an isolated piece 
of metal), the Hamiltonian is bound to have this structure. Conversely, in situations where 
the number of excitations is not fixed (e.g. a theory of photons or phonons) particle number 
violating terms like bb or b'b' can appear. Such a situation is realized in our present prob- 
lem; the number of density excitations in an electron system is certainly not a conserved 
quantity which explains why contributions like b q b- q appear in H . 

To eliminate the non-particle-number-conserving contributions we should, somehow, 
transform the matrix 



K = 



vf + % m 



92 _ 
2t r 



§ VF + £ 



34 
2t r 



to a diagonal structure. Transformations of K can be generated by transforming the opera- 
tors b q and b^ q to a different representation. Specifically, with At q = (&£, 6 _ g ) T , we may define 
= T~ l A> q , where T is a 2 x 2 matrix acting on the two components of d'. (Since K does 
not depend on q , T can be chosen to have the same property.) After the transformation, 
the Hamiltonian will have the form 



H = ^T^KT^'q, (2.42) 

9>0 g>0 K f 

with a new matrix K' = T'KT. We will seek for a transformation T that makes K' diag- 
onal. However, an important point to be kept in mind is that not all 2x2 matrices T 
qualify as transformations. We must ensure that the transformed “vector” again has the 
structure A>' q = ( 6 )) , b'_ q ) , with a boson creation/annihilation operator in the first/second 
component - i.e. the commutation relations of the operators must be conserved by the 
transformation. Remembering that the algebraic properties of the operators b are speci- 
fied through commutation relations, this condition can be cast in mathematical form by 
requiring that the commutator ^ ■] = (— 03 )^ = ['FF,'lA] be invariant under the 
transformation. Using the fact that d'' = T _ 1 T, this condition is seen to be equivalent to 
the pseudo-unitarity condition, aj,T = <73. 




74 



Second quantization 



With this background, we are now in a position to find a transformation that brings the 
matrix K' to a 2 x 2 diagonal form. Multiplication of the definition K' = T'KT by (73 leads 
to 

T* KT = K’ cr 3 T t (7 3 a 3 KT = <j 3 K’ . 

T-i 

This means that the matrix a 3 K' is obtained by a similarity transformation T _1 (---)T 
from the matrix a 3 K , or, in other words, that the matrix <r 3 K' contains the eigenvalues ±u 
of er 3 K on its diagonal. (That the eigenvalues sum to 0 follows from the fact that the trace 
vanishes, tr(a 3 K) = 0.) However, the eigenvalues of a 3 K are readily computed as 

v P = [(27tvf + gif - gl ] 1/2 • (2.43) 

Thus, with <j 3 K' = cj 3 v p we arrive at K' = v p ■ id., where “id.” stands for the unit matrix . 19 
Substitution of this result into Eq. (2.42) finally leads to the diagonal Hamiltonian H = 
v P E q>0 or equivalently, making use of the identity ^' q ^' q = b\b q + b^_ q b- q + 1 , 

H = v P Y J \d\b\b q . (2.44) 

Here we have ignored an overall constant and omitted the prime on our new Bose operators. 

In the literature, the transfor- 
mation procedure outlined above is 
known as a Bogoliubov trans- 
formation. Transformations of this 
type are frequently applied in quan- 
tum magnetism (see below), super- 
conductivity, or, more generally, all 
problems where the particle number is not conserved. Notice that the possibility to trans- 
form to a representation ~ b^b does not imply that miraculously the theory has become 
particle number conserving. The new “quasi-particle” operators b are related to the origi- 
nal Bose operators through a transformation that mixes b and b^ . While the quasi-particle 
number is conserved, the number of original density excitations is not. 

Equations (2.43) and (2.44) represent our final solution of the problem of spinless inter- 
acting fermions in one dimension. We have suceeded in mapping the problem onto a form 
analogous to our previous results (1.34) and (1.39) for the phonon and the photon sys- 
tem, respectively. Indeed, all that has been said about those Hamiltonians applies equally 
to Eq. (2.44): the basic elementary excitations of the one-dimensional fermion system are 
waves, i.e. excitations with linear dispersion w = u p |< 7 |. In the present context, they are 



Nicolai Nikolaevich Bogoliubov 1909-92 

A theoretical physicist acclaimed for his works in 
nonlinear mechanics, statistical physics, theory of 
superfluidity and superconductivity, quantum field 
theory, renormalization group theory, proof of dis- 
persion relations, and elementary particle theory. 



19 Explicit knowledge of the transformation matrix T, i.e. knowledge of the relation between the operators b and 
b' , is not needed for our construction. However, for the sake of completeness, we mention that 

rj, _ cosh Ok sinh 0 k 

sinh Ok cosh Ok 

with tanh( 20 ) — — P2/(27tuf + g 4 ) represents a suitable parameterization. 





2.2 Applications of second quantization 



75 



termed charge density waves (CDW). The Bose creation operators describing these exci- 
tations are, up to the Bogoliubov transformation, and a momentum dependent scaling factor 
(2n / Lq) x ! 2 , equivalent to the density operators of the electron gas. For a non-interacting 
system, 52 = 04 = 0, and the CDW propagates with the velocity of the free Fermi particles, 
Up. A fictitious interaction that does not couple particles of opposite Fermi momentum, 
52 = 0, <74 ^ 0, speeds up the CDW. Heuristically, this can be interpreted as an “acceleration 
process” whereby a CDW pushes its own charge front. By contrast, interactions between 
left and right movers, 52 7 ^ 0, diminish the velocity, i.e. due to the Coulomb interaction it 
is difficult for distortions of opposite velocities to penetrate each other. (Notice that, for 
a theory with g 2 = 0, no Bogoliubov transformation would be needed to diagonalize the 
Hamiltonian, i.e. in this case, undisturbed left- and right-moving waves would be the basic 
excitations of the theory.) 

Our discussion above neglected the spin carried by the conduction electrons. Had we 
included the electron spin, the following picture would have emerged (see Problem 2.4): the 
long-range dynamics of the electron gas is governed by two independently propagating wave 
modes, the charge density wave discussed above, and a spin density wave (SDW ). 20 The 
SDW carries a spin current, but is electrically neutral. As with the CDW, its dispersion 
is linear with an interaction-renormalized velocity, v s (which, however, is generally larger 
than the velocity v p of the CDW). To understand the consequences of this phenomenon, 
imagine an electron had been thrown into the system (e.g. by attaching a tunnel contact 
somewhere along the wire). As discussed above, a single electron does not represent a stable 
excitation of the one-dimensional electron gas. What will happen is that the spectral weight 
of the particle 21 disintegrates into a collective charge excitation and a spin excitation. The 
newly excited waves then propagate into the bulk of the system at different velocities ±v p 
and ±u s . In other words, the charge and the spin of the electron effectively “disintegrate” 
into two separate excitations, a phenomenon known as spin— charge separation. Spin- 
charge separation in one-dimensional metals exemplifies a mechanism frequently observed 
in condensed matter systems: the set of quantum numbers carried by elementary particles 
may get effectively absorbed by different excitation channels. One of the more spectacular 
manifestations of this effect is the appearance of fractionally charged excitations in quantum 
Hall systems, to be discussed in more detail in Chapter 9. 

The theory of spin and charge density waves in one-dimensional conductors has a long 
history spanning four decades. However, despite the rigor of the theory its experimental 
verification has proved excruciatingly difficult! While various experiments are consistent 
with theory (for a review, see Ref. 17 ), only recently have signatures of spin and charge 
density wave excitations been experimentally observed. 



20 One may think of the charge density of the electron gas p — p 4- p as the sum of the densities of the spin up 
and spin down populations, respectively. The local spin density is then given by p s = p — p . After what has 
been said above, it is perhaps not too surprising that fluctuations of these two quantities represent the dominant 
excitations of the electron gas. What is surprising, though, is that these two excitations do not interact with 
each other. 

For a precise definition of this term, see Chapter 7. 




76 



Second quantization 



Quantum spin chains 

In the previous section, the emphasis was placed on charging effects generated by Coulomb 
interaction. However, as we have seen in Section 2.2, Coulomb interaction may also lead to 
the indirect generation of magnetic interactions. In one dimension, one can account for these 
mechanisms by adding to our previously structureless electrons a spin degree of freedom. 
This leads to the Tomonaga Luttinger liquid, a system governed by the coexistence of 
collective spin and charge excitations. However, to introduce the phenomena brought about 

by quantum magnetic correlations, 
it is best to first consider systems 
where the charge degrees of freedom 
are frozen and only spin excitations 
remain. Such systems are realized, 
for example, in Mott insulators 
where interaction between the spins 
of localized electrons is mediated by 
virtual exchange processes between 
neighboring electrons. One can 
describe these correlations through 
models of localized quantum spins - 
either in chains or, more generally, 
in higher-dimensional quantum spin 

lattices. We begin our discussion with the ferromagnetic spin chain. 



Werner Heisenberg 1901-76 

Nobel Laureate in Physics in 
1932 “for the creation of quan- 
tum mechanics, the application of 
which has, inter alia, led to the 
discovery of the allotropic forms 
of hydrogen." In 1927 he pub- 
lished his uncertainty principle, for 
which he is perhaps best known, 
important contributions to the theories of hydrody- 
namics of turbulent flows, ferromagnetism, cosmic 
rays, and subatomic particles, and he was instrumen- 
tal in planning the first West German nuclear reac- 
tor at Karlsruhe, together with a research reactor in 
Munich, in 1957. (Image © The Nobel Foundation.) 




He also made 



Quantum ferromagnet 

The quantum Heisenberg ferromagnet is specified by the Hamiltonian 




(2.45) 



where J > 0, S m represents the quantum mechanical spin operator at lattice site m, and, 
as before, (■ mn ) denotes summation over neighboring sites. In Section 2.1 (see Eq. (2.13)) 
the quantum mechanical spin was represented through an electron basis. However, one can 
conceive of situations where the spin sitting at site m is carried by a different object (e.g. an 
atom with non- vanishing magnetic moment). At any rate, for the purposes of our present 
discussion, we need not specify the microscopic origin of the spin. All we need to know is 
(i) that the lattice operators S l m obey the SU(2) commutator algebra, 

[S^, Si] =iS mn e ijk St (2.46) 

characteristic of quantum spins, and (ii) the total spin at each lattice site is S. 22 



22 Remember that the finite-dimensional representations of the spin operator are of dimension 2 S + 1 where S 
may be integer or half integer. While a single electron has spin S — 1/2, the total magnetic moment of electrons 
bound to an atom may be much larger. 





2.2 Applications of second quantization 



77 




Figure 2.10 Showing the spin configuration of an elementary spin- wave excitation from the spin 
polarized ground state. 

Now, due to the positivity of the coupling constant J, the Hamiltonian favors configura- 
tions where the spins at neighboring sites are aligned in the same direction (cf. Fig. 2.10). A 
ground state of the system is given by |fi) = m 15m), where |5 m ) represents a state with 
maximal spin- 2 ; component: 5^|5 m ) = 5|5 m ). We have written “a” ground state instead of 
“the” ground state because the system is highly degenerate: a simultaneous change of the 
orientation of all spins does not change the ground state energy, i.e. the system possesses a 
global rotation symmetry. 

EXERCISE Compute the energy expectation value of the state | ). Defining global spin operators 
through S % = Ylm <Sm, consider the state |a) = exp(io-S)| ). Verify that the state a is degenerate 
with | ). Explicitly compute the state |(7r/2, 0, 0)). Convince yourself that, for general a , |a) can 
be interpreted as a state with rotated quantization axis. 

As with our previous examples, we expect that a global continuous symmetry will involve 
the presence of energetically low-lying excitations. Indeed, it is obvious that, in the limit 
of long wavelength A, a weak distortion of a ground state configuration (see Fig. 2.10) will 
cost vanishingly small energy. To quantitatively explore the physics of these spin waves, 
we adopt a “semiclassical” picture, where the spin S 1 is assumed to be large. In this 
limit, the rotation of the spins around the ground state configuration becomes similar to 
the rotation of a classical magnetic moment. 

INFO To better understand the mechanism behind the semi-classical approximation, con- 
sider the Heisenberg uncertainty relation, AS 1 A ~ |([S' 1 , S'- 7 ] ) | = e yfc |(5 fc )|, where AS 1 is the 
root mean square of the quantum uncertainty of spin component i. Using the fact that | (S k ) \ S, 
we obtain for the relative uncertainty, AS 1 /S, 

AS i AS j S s i 
S S ~ S 2 > ' 

I.e., for S 1, quantum fluctuations of the spin become less important. 

In the limit of large spin S, and at low excitation energies, it is natural to describe the 
ordered phase in terms of small fluctuations of the spins around their expectation values 
(cf. the description of the ordered phase of a crystal in terms of small fluctuations of the 
atoms around the ordered lattice sites). These fluctuations are conveniently represented in 







78 



Second quantization 




Figure 2.11 Spin- wave spectrum of europium oxide as measured by inelastic neutron scattering at 
a reference temperature of 5.5 K. Note that, at low values of momenta q, the dispersion is quadratic, 
in agreement with the low-energy theory. (Exercise: A closer inspection of the data shows the 
existence of a small gap in the spectrum at q = 0. To what may this gap be attributed?) Figure 
reprinted with permission from L. Passell, O. W. Dietrich, and J. Als-Nielser, Neutron scattering 
from the Heisenberg ferromagnets EuO and EuS I: the exchange interaction, Phys. Rev. B 14 
(1976), 4897-907. Copyright (1976) by the American Physical Society. 



terms of spin raising and lowering operators: with S± = S ^ ± iS it is straightforward to 
verify that 



[Sm, ft] = ±8 mn s£, [$+, ft] = 25 mn S* 



(2.47) 



Application of ft ^ lowers (raises) the ^-component of the spin at site m by one. To 
actually make use of the fact that deviations around |fi) are small, a representation known 
as the Holstein— PrimakofF transformation 23 was introduced in which the spin operators 
S±,S are specified in terms of bosonic creation and annihilation operators and a: 



S-=al(2S-ala m ) l/ \ ft = (2S - ala m ) L/ " a m , ft = S - at 



1/2 



EXERCISE Confirm that the spin operators satisfy the commutation relations (2.47). 



The utility of this representation is clear. When the spin is large, S 1 , an expansion 
in powers of 1 /S gives S ^ = S — a^dm, S~ ~ (2S') 1 / 2 aJ re , and 5 + ~ (2S') 1 / 2 a m . In this 



23 



T. Holstein and H. Primakoff, Field dependence of the intrinsic domain magnetisation of a ferromagnet, Phys. 
Rev. 58 (1940), 1098-113. 






2.2 Applications of second quantization 



79 



approximation, the one-dimensional Heisenberg Hamiltonian takes the form 



H = 



-JJ2 s z m s z m+ 1 






-JNS 2 -JSj2 -2ala m + (ala m+1 +h.c.) +O(S 0 

m 

- JNS 2 + JS^ipUi - 4j(a ro+1 - a m ) + O(S 0 ). 



Keeping fluctuations at leading order in S, the quadratic Hamiltonian can be diagonalized by 
Fourier transformation. In this case, it is convenient to impose periodic boundary conditions: 
S m+ N = S rn . and a m+ N = a m , where N denotes the total number of lattice sites. Defining 



1 



ak = 



N 

7n 

m—1 



Akr 



l 

Vn 



B.Z. 

E 

k 



e- lkm a k , 



dk i dk' — ^kk' > 



where the summation over k runs over the Brillouin zone, the Hamiltonian for the one- 
dimensional lattice system takes the form (exercise) 



B.Z. 

H = - JNS 2 + ^WfcOfcOfc + C’(5' 0 )- 

k 



(2.48) 



Here Hujk = 2J5(1 — cos k) = AJS sin 2 (fc/2) represents the dispersion relation of the spin 
excitations. In particular, in the limit k — > 0, the energy of the elementary excitations van- 
ishes, Hujk —> JSk 2 . These massless low-energy excitations, known as magnons, describe the 
elementary spin- wave excitations of the ferromagnet. Taking into account terms at higher 
order in the parameter 1/S, one finds interactions between the magnons. A comparison of 
these theoretical predictions and experiment is shown in Fig. 2.11. 



Quantum antiferromagnet 

Having explored the elementary excitation spectrum of the ferromagnet, we now turn to 
the discussion of the spin S Heisenberg antiferromagnetic Hamiltonian 




where, once again, J >0. As we have seen above, such antiferromagnetic systems occur 






80 



Second quantization 




Figure 2.12 (a) Example of a two-dimensional bipartite lattice, (b) Example of a non-bipartite 
lattice. Notice that, with the latter, no antiferromagnetic arrangement of the spins can be made 
that recovers the maximum exchange energy from each and every bond. 



in the arena of strongly correlated 
electron compounds. Although the 
Hamiltonian differs from its fer- 
romagnetic relative “only” by a 
change of sign, the differences in 
the physics are drastic. Firstly, the 
phenomenology displayed by the 
antiferromagnetic Hamiltonian H 
depend sensitively on the morphol- 
ogy of the underlying lattice. For a 
bipartite lattice, i.e. one in which the neighbors of one sublattice A belong to the other 
sublattice B (see Fig. 2.12(a)), the ground states of the Heisenberg antiferromagnet are 
close 24 to a staggered spin configuration, known as a Neel state, where all neighboring 
spins are antiparallel (see Fig. 2.12). Again the ground state is degenerate, i.e. a global 
rotation of all spins by the same amount does not change the energy. By contrast, on non- 
bipartite lattices such as the triangular lattice shown in Fig. 2.12(b), no spin arrangement 
can be found wherein each bond recovers the full exchange energy J. Spin models of this 
kind are said to be frustrated. 

EXERCISE Engaging only symmetry considerations, try to identify a possible classical ground 
state of the triangular lattice Heisenberg antiferromagnet. (Hint: Construct the classical ground 
state of a three-site plaquette and then develop the periodic continuation.) Show that the classical 
antiferromagnetic ground state of the Kagome lattice - a periodic array of corner-sharing “stars 
of David” has a continuous spin degeneracy generated by local spin rotations. How might the 
degeneracy affect the transition to an ordered phase? 

Returning to the one-dimensional system, we first note that a chain is trivially bipartite. 
As before, our strategy will be to expand the Hamiltonian in terms of bosonic operators. 
However, before doing so, it is convenient to apply a canonical transformation to the Hamil- 
tonian in which the spins on one sublattice, say B, are rotated through 180° about the 



Louis Neel 1904-2000 


__ 




Nobel Laureate in Physics in 


E * 1 




1970, shared with Hannes Olof 






Gosta Alfven, for his "funda- 


BBi M 


Li 


mental work and discoveries 






concerning antiferromagnetism 


HB 


i 


and ferrimagnetism that have led 






to important applications in solid 






state physics." (Image © The Nobel Foundation.) 



24 It is straightforward to verify that the classical ground state — the Neel state — is now not an exact eigenstate of 
the quantum Hamiltonian. The true ground state exhibits zero-point fluctuations reminiscent of the quantum 
harmonic oscillator or atomic chain. However, when S 1, it serves as a useful reference state from which 
fluctuations can be examined. 





2.2 Applications of second quantization 



81 





Figure 2.13 (a) Neel state configuration of the spin chain, (b) Cartoon of an antiferromagnetic 
spin wave. 



z-axis, i.e. S B — > S B = S B , S B — > S B = — S B , and S B — > S B = —S B . That is, when repre- 
sented in terms of the new operators, the Neel ground state looks like a ferromagnetic state, 
with all spins aligned. We expect that a gradual distortion of this state will produce the 
antiferromagnetic analog of the spin waves discussed in the previous section (see Fig. 2.11). 

Represented in terms of the transformed operators, the Hamiltonian takes the form 



h = ~jJ2 



QZ QZ 



O- c— 

J m kJ m-\-l ' u m u m -\- 1 



Once again, applying an expansion of the Holstein Primakoff representation, S m ~ 
(2 5) 1 / 2 aj n , etc., one obtains the Hamiltonian 

H — N J S H - J S ''y ^ H - "b 

m 

At first sight the structure of this Hamiltonian, albeit quadratic in the Bose operators, looks 
awkward. However, after Fourier transformation, a m = -/V -1 / 2 ^2 k e~ lkm ak it assumes the 
more accessible form 



+ 0(5°). 



H = —NJS(S + 1 ) + (4 o-fc) ( y \ k ) + O(S 0 ), 

where jk — cos k. Apart from the definition of the matrix kernel between the Bose oper- 
ators, H is equivalent to the Hamiltonian (2.41) discussed in connection with the charge 
density wave. Performing the same steps as before, the non-particle-number-conserving con- 






82 



Second quantization 




Figure 2.14 Experimentally obtained spin- wave dispersion of the high-T c parent compound 
LaCuC >4 - a prominent spin 1/2 antiferromagnet. Reprinted with permission from R. Coidea 
S. M. Hayder, G. Aeppli, et al, Spin waves and electronic excitations in La 2 Cu 04 , Phys. Rev. Lett. 
86 (2001), 5377-80. Copyright (2001) by the American Physical Society. 



tributions ad ad can be removed by Bogoliubov transformation. As a result, the transformed 
Hamiltonian assumes the diagonal form 



H = —NJS 2 + 2JS^\ sin k\ 

k 






i' 



<<*k + ~ 



(2.49) 



Thus, in contrast to the ferromagnet, the spin-wave excitations of the antiferromagnet 
(Fig. 2.14) display a linear spectrum in the limit k — > 0. Surprisingly, although developed 
in the limit of large spin, experiment shows that even for 5 = 1/2 spin chains, the integrity 
of the linear dispersion is maintained (see Fig. 2.14). 

More generally, it turns out that, for chains of arbitrary half integer spin S = 
1/2, 3/2, 5/2, . . ., the low-energy spectrum is linear, in agreement with the results of the 
harmonic approximation. In contrast, for chains of integer spin S = 1,2,3..., the low- 
energy spectrum contains a gap, i.e. these systems do not support long-range excitations. 
As a rule, the sensitivity of a physical phenomenon to the characteristics of a sequence 
of numbers - such as half integer vs. integer - signals the presence of a mechanism of 
topological origin. 25 At the same time, the formation of a gap (observed for integer chains) 
represents an interaction effect; at orders beyond the harmonic approximation, spin waves 
begin to interact nonlinearly with each other, a mechanism that may ( S integer) but need 
not (5 half integer) destroy the-wave like nature of low-energy excitations. In Section 9.3.3 
- in a chapter devoted to a general discussion of the intriguing condensed matter phenom- 



25 



Specifically, the topological signature of a spin field configuration will turn out to be the number of times the 
classical analog of a spin (a vector on the unit sphere) will wrap around the sphere in (1 + l)-dimensional space 
time. 





2.3 Summary and outlook 



ena generated by the conspiracy of global (topological) structures with local interaction 
mechanisms - we will discuss these phenomena on a deeper level. 



2.3 Summary and outlook 

This concludes our preliminary discussion of applications of the second quantization. Addi- 
tional examples can be found in the problems. In this chapter, we have introduced second 
quantization as a tool whereby problems of many-body quantum mechanics can be addressed 
more efficiently than by the traditional language of symmetrized nrany-body wave functions. 
We have discussed how the two approaches are related to each other and how the standard 
operations of quantum mechanics can be performed by second quantized methods. 

One may note that, beyond qualitative discussions, the list of concrete applications 
encountered in this chapter involved problems that either were non-interacting from the 
outset, or could be reduced to a quadratic operator structure by a number of suitable 
manipulations. However, we carefully avoided dealing with interacting problems where no 
such reductions are possible - the majority by far of the problems encountered in condensed 
matter physics. What can be done in situations where interactions, i.e. operator contribu- 
tions of fourth or higher order, are present and no tricks like bosonization can be played? 
Generically, either interacting problems of many-body physics are fundamentally inaccessi- 
ble to perturbation theory, or they necessitate perturbative analyses of infinite order in the 
interaction contribution. Situations where a satisfactory result can be obtained by first- or 
second-order perturbation theory are exceptional. Within second quantization, large-order 
perturbative expansions in interaction operators lead to complex polynomials of creation 
and annihilation operators. Quantum expectation values taken over such structures can be 
computed by a reductive algorithm, known as Wick’s theorem. However, from a modern 
perspective, the formulation of perturbation theory in this way is not very efficient. More 
importantly, problems that are principally non-perturbative have emerged as the focus of 
interest. 

To understand the language of modern condensed matter physics, we thus need to develop 
another layer of theory, known as field integration. In essence, the latter is a concept 
generalizing the effective action approach of Chapter 1 to the quantum level. However, 
before discussing quantum field theory, we should understand how the concept works in 
principle, i.e. on the level of single particle quantum mechanics. This will be the subject of 
the next chapter. 



2.4 Problems 

Stone~von Neumann theorem 

In the text we introduced creation and annihilation operators in a constructive manner, i.e. by specifying 
their action on a fixed Fock space state. We saw that this definition implied remarkably simple algebraic 
relations between the newly introduced operators - the Heisenberg algebra (2.7). In this problem we 
explore the mathematical structure behind this observation. (The problem has been included for the 




84 



Second quantization 



benefit of the more mathematically inclined. Readers primarily interested in the practical aspects of 
second quantization may safely skip it.) 

Let us define an abstract algebra of objects a A and a A by 

[a A , = <5 Am , [a\, aJc = [“A. = °- 

Further, let us assume that this algebra is unitarily represented in some vector space T . 
This means that (i) to every a\ and a A we assign a linear map T ax : T — > T such that 
(ii) T | OAi 5 ] c = [T ax ,Ta^]Q, and (iii) = T\ x . To keep the notation simple, we will denote 
T ax by a\ (now regarded as a linear map T — t T) and Ta x by a^. 

The Stone-von Neumann theorem states that the representation above is unique, i.e. that, 
up to unitary transformations of basis, there is only one such representation. The statement 
is proven by explicit construction of a basis on which the operators act in a specific and well- 
defined way. We will see that this action is given by Eq. (2.6), i.e. the reference basis is but 
the Fock space basis used in the text. This proves that the Heisenberg algebra encapsulates 
the full mathematical structure of the formalism of second quantization. 

(a) We begin by noting that the operators fi\ = a^a A are Hermitian and commute with each 
other, i.e. they can be simultaneously diagonalized. Let |n Al ,n A2 , ■ • ■) be an ortlronor- 
malized eigenbasis of the operators {h A }, i.e. h Ai |n Al , n\ 2 ,...)= n\ i \n \ 1 , n \ 2 ,...). Show 
that, up to unit-modular factors, this basis is unique. (Hint: Use the irreducibility of 
the transformation.) 

(b) Show that a A Jn Al ,n A2 , . . .) is an eigenstate of h\ i with eigenvalue n\ i — 1. Use this 
information to show that all eigenvalues n\ i are positive integers. (Hint: note positivity 
of the scalar norm.) Show that the explicit representation of the basis is given by 

t nx i 

|n Al ,n A2 ,...> = n^HI°>’ ( 2 ' 5 °) 

i nAW 

where |0) is the unique state which has eigenvalue 0 for all hj. Comparison with Eq. (2.4) 
shows that the basis constructed above indeed coincides with the Fock space basis 
considered in the text. 

Answer: 

(a) Suppose we had identified two bases {|n Al ,n A2 , . . .)} and {|n Al ,n A2 , . . .)'} on which 
all operators hi assumed equal eigenvalues. The irreducibility of the representation 
implies the existence of some polynomial P^a^, aL}) such that |n Al ,n A2 , . . .) = 
P({a^ i ,aj l .})\n\ 11 n\ 2 , . . . Now, the action of P must not change any of the eigen- 
values of hi, which means that P contains the operators and ajj in equal numbers. 
Reordering operators, we may thus bring P into the form a^}) = P({h Mi }). 

However, the action of this latter expression on |n Al ,n A2 , . . .)' just produces a number, 
i.e. the bases are equivalent. 




2.1 Problems 



85 



(b) For a given state | n), (concentrating on a fixed element of the single-particle basis, we 
suppress the subscript A,: throughout), let us choose an integer q such that ha q ~ 1 \n) = 
(re — q + l)o 9_1 |n) with n — q + 1 > 0 while n — q < 0. We then obtain 

0 > in — q)(n\a^ q a q \n) = ( n\a) q na q \n ) = (n|(a^) ?+1 a 9+1 |n) > 0. 

The only way to satisfy this sequence of inequalities is to require that (n|a' i ' <?+1 |a <?+1 |n) = 
0 and n — q = 0. The last equation implies the “integer- valuedness” of n. (In principle, 
we ought to prove that a zero-eigenvalue state |0) exists. To show this, take any reference 
state |?iA 1 ,nA 2 5 • • ■) and apply operators a\ i as long as it takes to lower all eigenvalues 
n\ i to zero.) Using the commutation relations, it is then straightforward to verify that 
the r.h.s. of Eq. (2.50) (a) is unit normalized and (b) has eigenvalue n\ i for each h\ i . 



Semiclassical spin waves 

In Chapter 1, the development of a theory of lattice vibrations in the harmonic atom chain was motivated 
by the quantization of the continuum classical theory. The latter provided insight into the nature of 
the elementary collective excitations. Here we will employ the semiclassical theory of spin dynamics to 
explore the nature of elementary spin-wave excitations. 

(a) Making use of the spin commutation relation, [5f, S ?] = iS i je a ^ 1 S '? , apply the operator 

identity = [Si,H] to express the equation of motion of a spin in a nearest neighbor 
spin-5 1 one-dimensional Heisenberg ferromagnet as a difference equation (N.B. h = 1). 

(b) Interpreting the spins as classical vectors, and taking the continuum limit, show that 
the equation of motion of the hydrodynamic modes takes the form S = JS x d 2 S where 
we have assumed a unit lattice spacing. (Hint: In taking the continuum limit, apply a 
Taylor expansion to the spins i.e. 5,;+i = Si + dSi + •••.) Find and sketch a wave-like 
solution describing small angle precession around a globally magnetized state Sj = Se z 
(i.e. a solution as shown in Fig. 2.10). 

Answer: 



(a) Making use of the equation of motion, and the commutation relation, substitution of 
the Heisenberg ferromagnetic Hamiltonian gives the difference equation 

^ = JSi x (S j+1 + Si-i). 

(b) Interpreting the spins as classical vectors, and applying the Taylor expansion Sj+i i — > 

S(x + 1) = S + <9S + d 2 S/2 H , one obtains the classical equation of motion shown. 

Making the ansatz S = (ccos(fcx — u>t), csin(kx — uit), V S 2 — c 2 ) one may confirm that 
the equation of motion is satisfied if co = Jk 2 . 




86 



Second quantization 




Figure 2.15 (a) An sp 2 -hybridized polymer chain, (b) One of the configurations of the Peierls 
distorted chain. The double bonds represent the short links of the lattice, (c) A topological defect 
separating two domains of the ordered phase. 

Su-Shrieffei — Heeger model of a conducting polymer chain 

Polyacetylene consists of bonded CH groups forming an isomeric long-chain polymer. According to 
molecular orbital theory, the carbon atoms are expected to be sp 2 -hybridized suggesting a planar config- 
uration of the molecule. An unpaired electron is expected to occupy a single 7r-orbital which is oriented 
perpendicular to the plane. The weak overlap of the 7r-orbitals delocalizes the electrons into a narrow 
conduction band. According to the nearly free electron theory, one might expect the half-filled con- 
duction band of a polyacetylene chain to be metallic. However, the energy of a half-filled band of a 
one-dimensional system can always be lowered by imposing a periodic lattice distortion known as a 
Peierls instability (see Fig. 2.15). The aim of this problem is to explore the instability. 

(a) At its simplest level, the conduction band of polyacetylene can be modeled by a simple 
(arguably over-simplified) microscopic Hamiltonian, due to Su, Shrieffer, and Heeger , 26 
in which the hopping matrix elements of the electrons are modulated by the lattice 
distortion of the atoms. By taking the displacement of the atomic sites to be u n . and 
treating their dynamics as classical, the effective Hamiltonian can be cast in the form 

N N k s 

H — t ^ ' (1 T Un') [cj^Cyi- |_i(j “F h.C.j T ^ ' 2 (t^ro-t-1 ) ? 

n = 1 n — 1 

where, for simplicity, the boundary conditions are taken to be periodic, and summation 
over the spins cr is assumed. The first term describes the hopping of electrons between 
neighboring sites with a matrix element modulated by the periodic distortion of the 
bond-length, while the last term represents the associated increase in the elastic energy. 
Taking the lattice distortion to be periodic, u n = (— l) n a, and the number of sites to 
be even, bring the Hamiltonian to diagonal form. (Hint: Note that the lattice distortion 
lowers the symmetry of the lattice. The Hamiltonian is most easily diagonalized by 
distinguishing the two sites of the sublattice - i.e. doubling the size of the elementary 
unit cell.) Show that the Peierls distortion of the lattice opens a gap in the spectrum at 
the Fermi level of the half-filled system. 

(b) By estimating the total electronic and elastic energy of the half-filled band (i.e. an 
average of one electron per lattice site), show that the one-dimensional system is always 
unstable towards the Peierls distortion. To complete this calculation, you will need 



26 



W. P. Su, J. R. Schrieffer, and A. J. Heeger, Solitons in polyacetylene, Phys. Rev. Lett. 42 (1979), 1698—701. 




2.1 Problems 



87 



the approximate formula for the (elliptic) integral, ^ (l — (l — a 2 ) sin 2 fc) 1 ^ 2 ~ 

2+(ai — b\ lna 2 )a 2 +(D(a 2 In a 2 ), where aq and b\ are (unspecified) numerical constants, 
(c) For an even number of sites, the Peierls instability has two degenerate configurations 
(see Fig. 2.15(a)) - ABABAB. . . and BABABA. . . Comment on the qualitative form of 
the ground state lattice configuration if the number of sites is odd (see Fig. 2.15(b)). 
Explain why such configurations give rise to mid-gap states. 

Answer: 



(a) Since each unit cell is of twice the dimension of the original lattice, we begin by recasting 
the Hamiltonian in a sublattice form 

N/2 

H = -t (1 + a) [aln^bma +h.c.] + (1 - a) [b^dm+ia + h.c.] +2 Nk s a 2 , 



771 = 1,(7 



where the creation operators and b' m act on the two sites of the elemental unit cell of 
the distorted lattice. Switching to the Fourier basis, a m = \Z^/NJ2k e 2?fem afc (similarly 
b m ), where k takes N/2 values uniformly on the interval [— 7 t/2,7t/ 2] and the lattice 
spacing of the undistorted system is taken to be unity, the Hamiltonian takes the form 



H = 2Nkf,a z 



- t 



X (°L»' 



ka 



0 

(1 cn) (1 — a)e 



—2 ik 



(1 +a) + (1 - a)e 2ik 

0 



ttkcr 

bkcr 



-J- / 2 

Diagonalizing the 2x2 matrix , one obtains e(k) = ±2 1 [l + (a 2 — 1) sin 2 k] . Reas- 
suringly, in the limit a — > 0, one recovers the cosine spectrum characteristic of the undis- 
torted tight- binding problem while, in the limit a — > 1, pairs of monomers become 
decoupled and we obtain a massively degenerate bonding and antibonding spectrum. 

(b) According to the formula given in the text, the total shift in energy is given by Se = 
— 4t(oi — b\ lna 2 )a 2 + 2 k s a 2 . Maximizing the energy gain with respect to a, one finds 
that the stable configuration is found when a 2 = exp[|^ — 1 — ^^]- 

(c) If the number of sites is odd, the Peierls distortion is inevitably frustrated. The result is 
that the polymer chain must accommodate a topological excitation. The excitation 
is said to be topological because the defect cannot be removed by a smooth continu- 
ous deformation. Its effect on the spectrum of the model is to introduce a state that 
lies within the band gap of the material. The consideration of an odd number of sites 
forces a topological defect into the system. However, even if the number of sites is even, 
one can create low energy topological excitations of the system either by doping (see 
Fig. 2.15(b)), or by the creation of excitons, particle-hole excitations of the system. 
Indeed, such topological excitations can dominate the transport properties of the system. 




Second quantization 



Schwinger boson representation 

As with the Holstein-Primakoff representation, the Schwinger boson provides another representation of 
quantum mechanical spin. The aim of this problem is to confirm the validity of the representation. For 
practical purposes, the value of the particular representation depends on its application. 

In the Schwinger boson repre- 
sentation, the quantum mechanical 
spin is expressed in terms of two 
bosonic operators a and b in the form 

S + = a*b, S~ = {S + )', 

S z = i (a'a-b'b) . 



Julian Schwinger 1918-1994 

Nobel Laureate in Physics with 
Sin-ltiro Tomonaga and Richard 
P. Feynman, for their fundamen- 
tal work in quantum electrody- 
namics, with far-reaching conse- 
quences for the physics of ele- 
mentary particles. (Image © The 
Nobel Foundation.) 




(a) Show that this definition is consistent with the commutation relations for spin: 
[S+,S~] = 2 S z . 

(b) Using the bosonic commutation relations, show that 



I S, m) 



( a t)S+m (fct)S-m 

\/(S + to)! \/(S — m)! 



|U>, 



is compatible with the definition of an eigenstate of the total spin operator S 2 and S z . 
Here |Q) denotes the vacuum of the Schwinger bosons, and the total spin S defines the 
physical subspace {\n a ,nb)\n a + nb = 25}. 



Answer: 



(a) Using the commutation relation for bosons, one finds [5 + ,5“] = a)b b^a — tfa a)b = 
a'a — b^b = 2 S z , as required. 

(b) Using the identity S 2 = (S z ) 2 + ^(S + S~ + S~ S + ) = |(n 0 — hb) 2 +h a hb+\(h a +hb) one 
finds that S 2 |5, m) = [ m 2 + (5 + m)(S — m) + S] IS 1 , m) = S(S + 1)15, m), as required. 
Similarly, one finds S Z \S, m) = \ (n 0 — nb)\n a = S + m, Ub = S — m) = m\S,m) showing 
1 5, to) to be an eigenstate of the operator S z with eigenvalue m. 



Jordan Wigner transformation 

So far we have shown how the algebra of quantum mechanical spin can be expressed using boson 
operators - cf. the Holstein-Primakoff transformation and the Schwinger boson representation. In this 
problem we show that a representation for spin-(l/2) can be obtained in terms of Fermion operators. 

Let us represent an up spin as a particle and a down spin as the vacuum, i.e. I'D = |1) = 
/f|0),|4_) = |0) = /|1). In this representation the spin raising and lowering operators are 
expressed in the forms S + = and S~ = /, while S z = Pf — 1/2. 




2.J Problems 



89 



(a) With this definition, confirm that the spins obey the algebra [S + , S~] = 2 S z . 

However, there is a problem: spins on different sites commute while fermion operators 
anticommute, e.g. S'+S) 1 " = but ///j = — fj.fj- To obtain a faithful spin repre- 

sentation, it is necessary to cancel this unwanted sign. Although a general procedure is 
hard to formulate, in one dimension this can be achieved by a nonlinear transformation, 
namely 

S+ = /t e - E, <, , Sf = e~ in ^<« fl , §f 

Operationally, this seemingly complicated transformation is straightforward: in one 
dimension, the particles can be ordered on the line. By counting the number of par- 
ticles “to the left” we can assign an overall sign of +1 or —1 to a given configuration 
and thereby transmute the particles into fermions. (Put differently, the exchange of 
two fermions induces a sign change that is compensated by the factor arising from the 
phase - the “Jordan-Wigner string.”) 

(b) Using the Jordan-Wigner representation, show that S+S - +1 = f} n f m+ 

(c) For the spin-(l/2) anisotropic quantum Heisenberg spin chain, the spin Hamiltonian 

assumes the form H = —J2 n JzSn&n+i + {^n^n+i + • Turning to the 

Jordan-Wigner representation, show that the Hamiltonian can be cast in the form 

H = - £ (flfn+l + h.C.) + J z ( \ ~ fl fn + flfnfl+Jn+l) • 

(d) The mapping above shows that the one-dimensional quantum spin-(l/2) XF-model (i.e. 
J z = 0) can be diagonalized as a non-interacting theory of spinless fermions. In this 
case, show that the spectrum assumes the form e(k) = — J± cos ka. 

Answer: 

(a) Using the fermionic anti-commutation relations, one finds [S' -1- , S' - ]- = [/*,/]_ = flf — 
// t = 2/ t /-l = 2SU 

(b) Using the fact that the number operators on different sites commute, one finds 

S+S~ +1 = /t i e^E i<ra »i e -^5:,< ra+ in« /m+ i = fm+1 = p mfm+1 , where we 

have made use of the fact that, for fermionic particles, fl 1 ,e~' L ' nnm = 

(c) The fermion representation is simply obtained by substitution. 

(d) With J z = 0, the spin Hamiltonian assumes the form of a non-interacting tight-binding 
Hamiltonian H = 4^ X)„(/,t/n+ i+h.c.). This Hamiltonian, which has been encountered 
previously, is diagonalized in the Fourier space, after which one obtains the cosine band 
dispersion. 




90 



Second quantization 



Spin-charge separation in one- dimension 

In Section 2.2 a free theory of interacting spinless fermions was developed in one dimension making 
use of the bosonization formalism. This analysis showed that the low-energy degrees of freedom were 
described by hydrodynamic charge (i.e. density) fluctuations that propagated with a linear dispersion. 
However, as well as charge, the electron degrees of freedom carry spin. The aim of this problem is to 
explore the fate of the spin degrees of freedom in a one-dimensional environment. 

As a first step, we introduce operators (cf. Eq. (2.35)) p sqa = Yik a l(k+q)a askoi ’ a 
generalizing the previously introduced density operators for the presence of spin. Similarly, 
the bosonic degrees of freedom of the theory (cf. Eq. (2.40)) now carry a spin index, b q — > b qa . 
One aspect that makes the problem more difficult to tackle than the previously explored 
spinless case is that the 2/tF-momentum transfer interaction |/cp + q + <Zi,ti + q — qi,i 
) | — fcp + 9 + ? 2 i ti — ^’f + 9 ~ <Z 2 i 4-) in which a right-moving spin up electron is scattered 

to a left-moving spin up electron cannot be expressed in terms of slowly fluctuating density 
operators. (If you don’t believe this, try!) However, using the renornralisation group methods 
introduced in Chapter 8, it can be shown that this type of interaction is physically largely 
irrelevant and can be neglected from the outset. 

Concentrating on the low-momentum-transfer interaction, the effective bosonic Hamilto- 
nian assumes the form (exercise) 

H = E v FQ b lqa b sqa. H - £ M 

q>0,s,a q>0,s,a,a' 

Introducing operators that create charge (p) and spin (a) fluctuations, b sqp = ^(b sq -j- + 
b sqi), b sqa = ^( b sq-f ~ b sqi ) i rearrange the Hamiltonian, and thereby show that it assumes 
a diagonal form with the spin and charge degrees of freedom exhibiting different velocities. 
This is a manifestation of spin— charge separation: even without the introduction of spin- 
dependent forces, the spin and charge degrees of freedom of the electron in the metallic 
conductor separate and propagate at different velocities. In this sense, there is no way 
to adiabatically continue from non-interacting electrons to the collective charge and spin 
excitations of the system. 

Despite the “fragility” of the electron, and the apparent ubiquity of this phenomenon, 
the observation of spin charge separation in one-dimensional conductors has pre- 
sented a significant challenge to experimentalists. The reason is subtle. The completion 
of an electrical circuit necessarily requires contact of the quantum wire with bulk leads. 
The leads involve a reservoir of electrons with conventional Fermi-liquid character. Electri- 
cal transport requires the recombination of the collective charge (holon) and spin (spinon) 
degrees of freedom at the contact to reconstitute physical electrons. It is an exasperating 
fact that this reconstitution of the physical electron “masks” the character of spin-charge 
separation. Instead, the phenomenon of spin-charge separation has been inferred indirectly 
through spectroscopic techniques. 

Answer: 

Motivated by the separation into spin and charge degrees of freedom, a rearrangement of 



9^ 

T 



( b i 



h[_ .-(-he 

sqot u sqa' ' 



#4 U 

obsqcPaq*' 




2-1 Problems 



91 



the Hamiltonian gives 

H=Y v fQ {blgpbsqp + bl q<7 b sqa ) + \q\ ^ {b\ qp b\ qp + h.c.) + ^b\ qp b sqp . 

< 3 > 0 ,s 

Once again, applying a Bogoliubov transformation, the Hamiltonian is brought to the diag- 
onal form 



Y + g4/n) 2 ^ (g 2 /TT) 2 al qp a S q P + \q\v F at qa a sqa + const.. 



The Kondo problem 

Historically, the Kondo problem has assumed a place of great significance in the development of the field 
of strongly correlated quantum systems. It represents perhaps the simplest example of a phenomenon 
driven by strong electron interaction and, unusually for this arena of physics, admits a detailed theoretical 
understanding. Further, in respect of the principles established in Chapter 1, it exemplifies a number of 
important ideas from the concept of reducibility - the collective properties of the system may be captured 
by a simplified effective Hamiltonian which includes only the relevant low-energy degrees of freedom - 
and the renormalization group. In the following problem, we will seek to develop the low-energy theory 
of the "Kondo impurity system” leaving the discussion of its phenomenology to Problems 5.5 and 8.8.5 
in subsequent chapters. 

The Kondo effect is rooted in the experimental observation that, when small amounts of 
magnetic ion impurities are embedded in a metallic host (such as manganese in copper, or 
iron in CuAu alloys), a pronounced minimum develops in the temperature dependence of 
the resistivity. Although the phenomenon was discovered experimentally in 1934, 27 it was 
not until 1964 that a firm understanding of the phenomenon was developed by Kondo. 28 
Historically, the first step towards the resolution of this phenomenon came with a suggestion 
by Anderson that the system could be modeled as an itinerant band of electron states inter- 
acting with local dilute magnetic moments associated with the ion impurities. 29 Anderson 
proposed that the integrity of the local moment was protected by a large local Coulomb 
repulsion which inhibited multiple occupancy of the orbital state - a relative of the Hubbard 
[/-interaction. Such behavior is encoded in the Anderson impurity Hamiltonian 

H = Y [ e k C L C k<r + (Vlc^-Cka + h.C.) + Y 6 d n da + Uu^Udl, 

k g cr 

where the operators cL create itinerant electrons of spin o and in the metallic host 
while the operators d\ create electrons of spin cr on the local impurity at position r = 0. 
Here we have used = d^d^ to denote the number operator. While electrons in the band 

27 de Haas, de Boer, and van den Berg, The electrical resistance of gold, copper, and lead at low temperatures, 
Physica 1 (1934), 1115. 

28 J. Kondo, Resistance minimum in dilute magnetic alloys, Prog. Theor. Phys. 32 (1964), 37—69. 

29 P. W. Anderson, Localized magnetic states in metals, Phys. Rev. 124 (1961), 41—53. 




92 



Second quantization 



are assumed to be characterized by a Fermi-liquid-like behavior, those associated with the 
impurity state experience an on-site Coulomb interaction of a strength characterized by a 
Hubbard energy U. 

According to the experimental phenomenology, the Fermi level cf is assumed to lie some- 
where in between the single-particle impurity level and e<j + U so that, on average, the 
site occupancy of the impurity is unity. Nevertheless, the matrix element coupling the local 
moment to the itinerant electron states H k = L ~ d / 2 f dr V (r)e lk r admits the existence of 
virtual processes in which the site occupancy can fluctuate between zero and two. These 
virtual fluctuations allow the spin on the impurity site to flip through exchange. 

The discussion of the half-filled Hubbard system in Section 2.2 suggests that it will 
be helpful to transform the Anderson impurity Hamiltonian to an effective theory which 
exposes the low-energy content of the system. To this end, let us express the total wave- 
function of the many-body Hamiltonian | ip) as the sum of terms \ipo), Itfi), and \ip 2 ), where 
the subscript denotes the occupancy of the impurity site. With this decomposition, the 
Schrodinger equation for the Hamiltonian can be cast in matrix form, X^n=o Hmn\ipn) = 
E\^ m ), where H mn = P m HP n , and the operators P m project onto the subspace with to 
electrons on the impurity (i.e. Pq = n<r(l ~ nd<r)i etc.). 



(a) Construct the operators H mn explicitly and explain why H 20 = H 02 = 0. 

(b) Since we are interested in the effect of virtual excitations from the |i/’i) subspace, we may 
proceed by formally eliminating |^o) and IV’ 2 ) from the Schrodinger equation. Doing so, 
show that the equation for \ipi) can be written as 



Hio - — A Hoi 
& ~ -n00 



H\\ + H12— £—H 2 i 

E — H 22 



\4>i) = E\^i). 



(c) At this stage, the equation for \tfji) is exact. Show that, when substituted into this 
expression, an expansion to leading order in 1/17 and 1 /ed leads to the expression 

1 - - 1 fj „ 

7 Urn ~ 




kk'crcr 



E-H 00 

C kcr C k'cr'^o-^CT' c ’k'(j' c k rT dl r ,d a 

U + €d — Ck' e k — 



To obtain the first term in the expression, consider the commutation of (E — H 22) 1 
with El 21 and make use of the fact that the total operator acts upon the singly occupied 
subspace. A similar line of reasoning will lead to the second term in the expression. Here 
U + €d — £k' a nc l £<i ~ £k denote the respective excitation energies of the virtual states. 

Making use of the Pauli matrix identity, cr a p ■ cr^g = 28 a gSp^ — SapSyg, it follows that 
(exercise) 

^ ' Cka ( ~]i , (7'd 0 .id [7 — 2s kk ' * S d T ~ ^ ' ^kcr^c' trader' ? 

aa' cru' 

where d' a cr a /gdp / 2 denotes the impurity spin 1/2 degree of freedom associated 

with the impurity and s kk / = ^af) c ka <T a/3 c k'/3/2. Combining this result with that 




2.1 Problems 



93 



obtained above, up to an irrelevant constant, the total effective Hamiltonian (including 
Hu) acting in the projected subspace, \ipi) is given by 



-ffsd — e k C ka c kcr + 

k (7 kk' 



2*/k,k ' Skk' ' S d + ^k,k' 



J2 c L ck ' 



where 



dk,k' 

b'k.k' 



V£V k 



VC>V* 



1 

U + 6d — £k' 

r i 



i 

e k - £d\ ’ 

1 1 



£k - £d 



U + €d~ Ck' 



With both U + €d and greatly in excess of the typical excitation energy scales, one 
may safely neglect the particular energy dependence of the parameters Jk.k' and A'k.k' • 
In this case, the exchange interaction Jk.k' can be treated as local, the scattering term 
A'k.k' can be absorbed into a shift of the single-particle energy of the itinerant band, and 
the positive (i.e. antiferromagnetic) exchange coupling can be accommodated through 
the effective sd-Hamiltonian 



-^sd ^ ' CkOj^Ckcr ^JSd * s(r 0), 

kcr 



(2.51) 



where s(r = 0) = X^kkW' c Lr cr °'0'' c kV'/2 denotes the spin density of the itinerant 
electron band at the impurity site. To understand how the magnetic impurity affects 
the low-temperature transport, we refer to Problem 5.5, where the sd-Hamiltonian is 
explored in the framework of a diagrammatic perturbation theory in the spin interaction. 



Answer: 



(a) Since the diagonal elements H mm leave the occupation number fixed, they may be 
identified with the diagonal elements of the microscopic Hamiltonian, i.e. 

H[)Q ^ ' CkCj^Ck (71 H\\ 'y ( CkCj^Cker “1“ H 2 2 — ^ ) CkC^^Cktr T d T U. 

k k k 

The off-diagonal terms arise from the hybridization between the free electron states and 
the impurity. Since the coupling involves only the transfer of single electrons, H 02 = 
H 2 q = 0 and 

H10 ^ ' Ukdj)(l n c ;^j)ckCT, H 2 \ ^ ) bkd^rr^CTCkcn 

kcr kcr 

where a =t for a =1 and vice versa, Hq\ = H{ 0 and H 12 = H 21 . 

(b) Using the fact that #oo|^o) T-ffoilVh}^ E\ip 0 ), one may set \i/j 0 ) = (£ , -i?oo) _1 ^oi|V’i) 
and, similarly, \tJj 2 ) = {E — H 22 )~ 1 H 2 i\'tpi). Then, substituting into the equation for 
|?/>i), one obtains the required expression. 





94 



Second quantization 



(c) Making use of the expressions from part (a), we have 

^ ^ cr^da d<j “ d /Tldd' Qc'cr' ? 

77> tt Z — / Kcr 77 tj 0 

LJ — 11 92 ii// jC/ — ii 99 

kk'crtr' 

10 ffoi — ^ ^ kh V^/ (1 ^'(ia’)ck(j ^ ^k'cr' ^dd'^du' . 

E ~ H 00 kkW ^ “ ff 00 



Then, substituting for H 22 and iioo from (a), and commuting operators, we have 

1 -1 



1 



d^/Tlda' C W'a' 

— d^, Tida' Ck'cr' Yt i 

E — H 22 U + Cd — £k' 



-^^4^,(1 - n ds ,)d a , = ~ ^ ,(1 ndg,) ^ 1 1 - 
E-H 00 £k ' - Q 

Then expanding in large 17 and e d , to leading order we obtain 
1 - - 1 



1 - 



00 



E — e d — H( 
U + e d — ek ' 



E — e d — Hf 



00 



£k' — e<i 



-1 



H 1 



E — H- 



-Ho 1 + H 



10 " 



22 



E — Hi 



Hoi ~ 



00 



\ ' ^ f C] i(T 'ri' dj (Td (T cl (7 r7Z d[ j' Cfc' a ' ^ (7 ^'rf,cr)^k<7'Cj c / (T / (1 Tl , d(7')d l j'\ 

, ; , , k k \ U + e d — ek' ek' — e<i / 

kk crcr \ / 



Finally, noting that this operator acts upon the singly-occupied subspace spanned by 
\i[>i), we see that the factors involving n da are redundant and can be dropped. As a 
result, swapping the momentum and spin indices in the second part of the expression, 
we obtain the required expression. 




3 

Feynman path integral 



The aim of this chapter is to introduce the concept of the Feynman path integral. As well as develop- 
ing the general construction scheme, particular emphasis is placed on establishing the interconnections 
between the quantum mechanical path integral, classical Hamiltonian mechanics, and classical statistical 
mechanics. The practice of path integration is discussed in the context of several pedagogical applica- 
tions. As well as the canonical examples of a quantum particle in a single and a double potential well, 
we discuss the generalization of the path integral scheme to tunneling of extended objects (quantum 
fields), dissipative and thermally assisted quantum tunneling, and the quantum mechanical spin. 



In this chapter we temporarily leave the arena of nrany-body physics and second quantiza- 
tion and, at least superficially, return to single-particle quantum mechanics. By establishing 
the path integral approach for ordinary quantum mechanics, we will set the stage for the 
introduction of field integral methods for nrany-body theories explored in the next chap- 
ter. We will see that the path integral not only represents a gateway to higher-dimensional 
functional integral methods but, when viewed from an appropriate perspective, already 
represents a field theoretical approach in its own right. Exploiting this connection, various 
concepts of field theory, namely stationary phase analysis, the Euclidean formulation of field 
theory, instanton techniques, and the role of topology in field theory, are introduced in this 
chapter. 



3.1 The path integral: general formalism 

Broadly speaking, there are two approaches to the formulation of quantum mechanics: the 
“operator approach” based on the canonical quantization of physical observables and the 
associated operator algebra, and the Feynman path integral . 1 Whereas canonical quantiza- 
tion is usually taught first in elementary courses on quantum mechanics, path integrals seem 
to have acquired the reputation of being a sophisticated concept that is better reserved for 



1 For a more extensive introduction to the Feynman path integral, one can refer to one of the many standard 
texts including R. P. Feynman and A. R. Hibbs, Quantum Mechanics and Path Integrals (McGraw-Hill, 1965), 
J. W. Negele and H. Orland, Quantum Many Particle Systems (Addison- Wesley, 1988), and L. S. Schulman, 
Techniques and Applications of Path Integration (Wiley, 1981). Alternatively, one may turn to the original 
paper, R. P. Feynman, Space-time approach to non-relativistic quantum mechanics, Rev. Mod. Phys. 20 (1948), 
362—87. Historically, Feynman’s development of the path integral was motivated by earlier work by Dirac on 
the connection between classical and quantum mechanics, P. A. M. Dirac, On the analogy between classical and 
quantum mechanics, Rev. Mod. Phys. 17 (1945), 195—9. 



95 




96 



Feynman path integral 



advanced courses. Yet this treat- 
ment is hardly justified! In fact, 
the path integral formulation has 
many advantages, most of which 
explicitly support an intuitive under- 
standing of quantum mechanics. 

Moreover, integrals - even the 
infinite-dimensional ones encoun- 
tered below - are hardly more 
abstract than infinite-dimensional 
linear operators. Further merits of 
the path integral include the following: 

> Whereas the classical limit is not always easy to retrieve within the canonical formulation 
of quantum mechanics, it constantly remains visible in the path integral approach. The 
latter makes explicit use of classical mechanics as a “platform” on which to construct a 
theory of quantum fluctuations. The classical solutions of Hamilton’s equation of motion 
always remain a central ingredient of the formalism. 2 

> Path integrals allow for an efficient formulation of non-perturbcitive approaches to the 
solution of quantum mechanical problems. Examples include the “instanton” formulation 
of quantum tunneling discussed below. The extension of such methods to continuum 
theories has led to some of the most powerful concepts of quantum field theory. 

> The Feynman path integral represents a prototype of the higher-dimensional field integrals 
to be introduced in the next chapter. However, even the basic “zero-dimensional” path 
integral is of relevance to applications in many-body physics. Very often, one encounters 
environments such as the superconductor, superfluid, or strongly correlated few-electron 
devices where a macroscopically large number of degrees of freedom “lock” to form a 
single collective variable. (For example, to a first approximation, the phase information 
carried by the order parameter field in moderately large superconducting grains can often 
be described in terms of a single phase degree of freedom, i.e. a “quantum particle” living 
on the unit circle.) Path integral techniques have proven ideally suited to the analysis of 
such systems. 

What then is the basic idea of the path integral approach? More than any other formula- 
tion of quantum mechanics, the path integral formalism is based on connections to classical 
mechanics. The variational approach employed in Chapter 1 relied on the fact that classi- 
cally allowed trajectories in configuration space extremize an action functional. A principal 
constraint to be imposed on any such trajectory is energy conservation. By contrast, quan- 
tum particles have a little bit more freedom than their classical counterparts. In particular, 
by the Uncertainty Principle, energy conservation can be violated by an amount A E over a 
time ~ h/ AE (here, and throughout this chapter, we will use h for clarity). The connection 



Richard P. Feynman 1918-88 

Nobel Laureate in Physics in 1965 
(with Sin-ltiro Tomonaga, and 
Julian Schwinger) for “fundamen- 
tal work in quantum electrody- 
namics, with far-reaching conse- 
quences for the physics of elemen- 
tary particles.” He was also well 
known for his unusual life style and for his popu- 
lar books and lectures on mathematics and physics. 
(Image © The Nobel Foundation.) 




For this reason, path integration has turned out to be an indispensable tool in fields such as quantum chaos 
where the quantum manifestations of classically non-trivial behavior are investigated — for more details, see 
Section 3.3. 




3.2 Construction of the path integral 



97 



to action principles of classical mechanics becomes particularly apparent in problems of 
quantum tunneling: a particle of energy E may tunnel through a potential barrier of height 
V > E. However, this process is penalized by a damping factor ~ exp (i / barrier dx p/ h) , 
where p = \j2m{E — V), i.e. the exponent of the (imaginary) action associated with the 
classically forbidden path. 

These observations motivate the idea of a new formulation of quantum propagation: could 
it be that, as in classical mechanics, the quantum amplitude A for propagation between 
any two points in coordinate space is again controlled by the action functional - controlled 
in a relaxed sense where not just a single extremal path x c \ (f), but an entire manifold 
of neighboring paths contribute? More specifically, one might speculate that the quantum 
amplitude is obtained as A ~ exp(iiS[x]/7i), where Y^ x (t) symbolically stands for a 

summation over all paths compatible with the initial conditions of the problem, and S 
denotes the classical action. Although, at this stage, no formal justification for the path 
integral has been presented, with this ansatz some features of quantum mechanics would 
obviously be borne out correctly. Specifically, in the classical limit (K — > 0), the quantum 
mechanical amplitude would become increasingly dominated by the contribution to the 
sum from the classical path x c \(t). This is because non-extremal configurations would be 
weighted by a rapidly oscillating amplitude associated with the large phase S/h and would, 
therefore, average to zero. 3 Secondly, quantum mechanical tunneling would be a natural 
element of the theory; non-classical paths do contribute to the net amplitude, but at the cost 
of a damping factor specified by the imaginary action (as in the traditional formulation). 

Fortunately, no fundamentally novel “picture” of quantum mechanics needs to be declared 
to promote the idea of the path “integral” Y^ x (t) exp(i5[x]/?i) to a working theory. As we 
will see in the next section, the new formulation can be developed from the established 
principles of canonical quantization. 



3.2 Construction of the path integral 

All information about an autonomous 4 quantum system is contained in its time evolution 
operator. A formal integration of the time-dependent Schrodinger equation ihdt] T) = iJlT) 
gives the time evolution operator 

l’F(t')) = U(t',t) = e - ^( t, - t )0(f' - t). (3.1) 

The operator U(t',t) describes dynamical evolution under the influence of the Hamiltonian 
from a time t to time t'. Causality implies that t' > t, as indicated by the step or Heaviside 
0-function. In the real space representation we can write 

^0 ?',*') = {q'\ ’J'(t')) = {q'\U{t' ,t)^{t)) = J dq U(q',t'; 



3 More precisely, in the limit of small h , the path sum can be evaluated by saddle-point methods, as detailed 
below. 

A system is classified as autonomous if its Hamiltonian does not explicitly depend on time. Actually the 
construction of the path integral can be straightforwardly extended so as to include time-dependent problems. 
However, in order to keep the introductory discussion as simple as possible, here we assume time independence. 




98 



Feynman path integral 



where U(q',t'\q,t) = (q'\e~i H ^ ~^\q)Q(t' — t) defines the (</, g)-component of the time 
evolution operator. As the matrix element expresses the probability amplitude for a par- 
ticle to propagate between points q and q' in a time t' — t, it is sometimes known as the 
propagator of the theory. 

The basic idea behind Feynman’s path integral approach is easy to formulate. Rather 
than attacking the Schrodinger equation governing the time evolution for general times t, 
one may first attempt to solve the much simpler problem of describing the time evolution 
for infinitesimally small times At. In order to formulate this idea one must first divide the 
time evolution into N 1 time steps, 



e ~iHt/h 



e ~iHAt/h 




(3.2) 



where At = t/N. Albeit nothing more than a formal rewriting of Eq. (3.1), the representa- 
tion (3.2) has the advantage that the factors e ~ lHAt / h ( 0 r, rather, their expectation values) 
are small. (More precisely, if At is much smaller than the [reciprocal of the] eigenvalues of 
the Hamiltonian in the regime of physical interest, the exponents are small in comparison 
with unity and, as such, can be treated perturbatively.) A first simplification arising from 
this fact is that the exponentials can be factorized into two pieces, each of which can be 
readily diagonalized. To achieve this factorization, we make use of the identity 

e ~iH At/h _ e ~iT At / h e ~iV At / h _|_ Q(At 2 ) 



where the Hamiltonian H = T + V is the sum of a kinetic energy T = p 2 /2m, and some 
potential energy operator V. 5 (The following analysis, restricted for simplicity to a one- 
dimensional Hamiltonian, is easily generalized to arbitrary spatial dimension.) The advan- 
tage of this factorization is that the eigenstates of each factor e r lTAt l h and are 

known independently. To exploit this fact we consider the time evolution operator factorized 
as a product, 



(«| 



0 —iH At/h 



N 



k) 



<®| A e -iTAt/h e -iVAt/h A . . . A e -ifAt/h e -iVAt/h\ q .^ 



(3.3) 



and insert at each of the positions indicated by the symbol “A” the resolution of identity 



id = 



dPn \qn){q n \Pn){Pn\- 



(3.4) 



Here \q n ) and | p n ) represent a complete set of position and momentum eigenstates respec- 
tively, and n = 1 , ... ,7V serves as an index keeping track of the time steps at which the 
unit operator is inserted. The rationale behind the particular choice (3.4) is clear. The 
unit operator is arranged in such a way that both T and V act on the corresponding 



Although this ansatz covers a wide class of quantum problems, many applications (e.g. Hamiltonians involving 
spin or magnetic fields) do not fit into this framework. For a detailed exposition covering its realm of applicability, 
we refer to the specialist literature such as, e.g., Schulman 1 . 




3.2 Construction of the path integral 



99 




Figure 3.1 (a) Visualization of a set of phase space points contributing to the discrete time con- 
figuration integral (3.5). (b) In the continuum limit, the set of points becomes a smooth curve. 



eigenstates. Inserting Eq. (3.4) into (3.3), and making use of the identity (q\p) = {p\q)* = 
e ic iP/ h /(fZTrh) 1 / 2 , one obtains 

TT (3.5) 

- L - L 2irh 

n—1 

lN = ( lf ’QO =< li 



<® \e~ i 6 t/h \qi) ^ f II 

•' n= 1 



Thus, the matrix element of the time evolution operator has been expressed as a (2 TV — 1)- 
dimensional integral over eigenvalues. Up to corrections of higher order in V At/ h and 
TAt/h, the expression (3.5) is exact. At each “time step” t n = nAt, n— 1, ... ,7V, we are 
integrating over a pair of coordinates x n = ( q n ,Pn ) parameterizing the classical phase 
space. Taken together, the points {x n } form an 7V-point discretization of a path in this 
space (see Fig. 3.1). 

To make further progress, we need to develop some intuition for the behavior of the 
integral (3.5). We first notice that rapid fluctuations of the integration arguments x n as 
a function of the index n are strongly inhibited by the structure of the integrand. When 
taken together, contributions for which (q n + 1 — q n )Pn+i > 0(h) (i.e. when the phase of the 
exponential exceeds 2n) tend to lead to a “random phase cancellation.” In the language of 
wave mechanics, the superposition of partial waves of erratically different phases destruc- 
tively interferes. The smooth variation of the paths that contribute significantly motivates 
the application of a continuum limit analogous to that employed in Chapter 1. 

To be specific, sending N — > oo whilst keeping t = NAt fixed, the formerly discrete set 
t n = nAt, n = 1, . . . , TV, becomes dense on the time interval [0, t], and the set of phase space 
points {x n } becomes a continuous curve x(t). In the same limit, 




Qn+1 Qn 

A t 



l— t dpq | t'=t n 



= <i\t'=t n , 



while [V(q n ) + T(p n+1 )\ [T(p| t / =t J + V(q\ t '=t n )\ = H(x\ t >=t n ) denotes the classical 
Hamiltonian. In the limit TV — > oo, the fact that kinetic and potential energies are evaluated 




100 



Feynman path integral 



at neighboring time slices, n and n + 1, becomes irrelevant. 6 Finally, 

„ N —1 N 



lim 

N—foo 



n d ^u 

n=i n = 1 

<2jV=9f >90 = < ?i 

defines the integration measure of the integral. 



dp n 

2nh 



' q(t)=q f 
g(0) = <2i 



Dx, 



INF Integrals extending over infinite-dimensional integration measures like D(q,p) are gener- 
ally called functional integrals (recall our discussion of functionals in Chapter 1). The question 
of the way functional integration can be rigorously defined is far from innocent and represents 
a subject of current, and partly controversial, mathematical research. In this book - as in most 
applications in physics - we take a pragmatic point of view and deal with the infinite-dimensional 
integration naively unless mathematical problems arise (which actually will not be the case!). 



Then, applying these conventions to Eq. (3.5), one finally obtains 






~ iAt/h \qi) = j q 



q(t)=q { 
<?(0) = <?i 



Dx exp 



i 

h 



dt' (pq — H (p, q)) 



(3.6) 



Equation (3.6) represents the Hamiltonian formulation of the path integral. The 

integration extends over all possible paths through the classical phase space of the system 
which begin and end at the same configuration points qi and q{ respectively (cf. Fig. 3.1). 
The contribution of each path is weighted by its Hamiltonian action. 



Remembering the connection of the Hamiltonian to the Lagrangian through the Legendre 
transform, H(p,q) = pq — L(p,q), the classical action of a trajectory t q(t) is given by 
S\p, q] = fg dt' L(q, q) = f* dt' \pq - H(p,q)]. 

Before we turn to the discussion of the path integral (3.6), it is useful to recast the integral in 
an alternative form which will be both convenient in applications and physically instructive. 
The search for an alternative formulation is motivated by the resemblance of Eq. (3.6) to 
the Hamiltonian formulation of classical mechanics. Given that, classically, Hamiltonian 
and Lagrangian mechanics can be equally employed to describe dynamical evolution, it is 
natural to seek a Lagrangian analog of Eq. (3.6). Focusing on Hamiltonians for which the 
kinetic energy T(p) is quadratic in p, the Lagrangian form of the path integral can indeed 
be inferred from Eq. (3.6) by straightforward Gaussian integration. 



6 To see this formally, one may Taylor expand T(p n _(_i) — T(p(t' + Af))| t /_ n t around pit'). For smooth [>(!.'). all 
but the zeroth-order contribution T(p(t')) scale with powers of At, thereby becoming irrelevant. Note, however, 
that all of these arguments are based on the assertion that the dominant contributions to the path integral are 
smooth in the sense q n +i~ q n ~ O(At). A closer inspection, however, shows that in fact q n +\— q n ~ 0(y/ At) (see 
Schulman 1 .) In some cases, the most prominent one being the quantum mechanics of a particle in a magnetic 
field, the lowered power of At spoils the naive form of the continuity argument above, and more care must be 
applied in taking the continuum limit. In cases where a “new” path integral description of a quantum mechanical 
problem is developed, it is imperative to delay taking the continuum limit until the fluctuation behavior of the 
discrete integral across individual time slices has been thoroughly examined. 





3.2 Construction of the path integral 



101 



To make this point clear, let us rewrite the integral in a way that emphasizes its depen- 
dence on the momentum variable p: 



(qt \e- iftt ' h \ qi ) = 



q(t) = q { 
q(0) = q i 



Dq 






(3.7) 



The exponent is quadratic in p which means that we are dealing with the continuum gener- 
alization of a Gaussian integral. Carrying out the integration by means of Eq. (3.13) below, 
one obtains 




(3.8) 



where Dq = lim^v-^oo II^Li 1 dq n denotes the functional measure of the remaining 

(/-integration, and L(q,q) = mq 2 /2 — V(q) represents the classical Lagrangian. Strictly 
speaking, the (finite-dimensional) integral formula (3.13) is not directly applicable to the 
infinite-dimensional Gaussian integral (3.7). This, however, does not represent a substantial 
problem as we can always rediscretize the integral (3.7), apply Eq. (3.13), and reinstate the 
continuum limit after integration (exercise). 

Together Eq. (3.6) and (3.8) represent the central results of this section. A quantum 
mechanical transition amplitude has been expressed in terms of an infinite-dimensional 
integral extending over paths through phase space, Eq. (3.6), or coordinate space, Eq. (3.8). 

All paths begin (end) at the initial 
(final) coordinate of the matrix ele- 
ment. Each path is weighted by its 
classical action. Notice in particular 
that the quantum transition ampli- 
tude has been cast in a form that 
does not contain quantum mechani- 
cal operators. Nonetheless, quantum 
mechanics is still fully present! The 
point is that the integration extends 
over all paths and not just the sub- 
set of solutions of the classical equa- 
tions of motion. (The distinguished 
role classical paths play in the path 
integral will be discussed below in Section 3.2.) The two forms of the path integral, Eq. (3.6) 
and Eq. (3.8), represent the formal implementation of the “alternative picture” of quantum 
mechanics proposed heuristically at the beginning of the chapter. 

Gaussian integration: Apart from a few rare exceptions, all integrals encountered in 
this book will be of Gaussian form. In most cases the dimension of the integrals will be large if 
not infinite. Yet, after a bit of practice, it will become clear that high-dimensional representatives 
of Gaussian integrals are no more difficult to handle than their one-dimensional counterparts. 



Johann Carl Friedrich Gauss 
1777-1855 

Worked in a wide variety of 
fields in both mathematics and 
physics including number the- 
ory, analysis, differential geome- 
try, geodesy, magnetism, astron- 
omy, and optics. As well as sev- 
eral books, Gauss published a number of memoirs 
(reports of his experiences), mainly in the journal 
of the Royal Society of Gottingen. However, in gen- 
eral, he was unwilling to publish anything that could 
be regarded as controversial and, as a result, some 
of his most brilliant work was found only after his 
death. 






102 



Feynman path integral 



Therefore, considering the important role played by Gaussian integration in field theory, we will 
here derive the principal formulae once and for all. Our starting point is the one-dimensional 
integral (both real and complex). The ideas underlying the proofs of the one-dimensional formulae 
will provide the key to the derivation of more complex functional identities that will be used 
liberally throughout the remainder of the text. 



One-dimensional Gaussian integral: The basic ancestor of all Gaussian integrals is the 
identity 



r _ a 2 


2tt 


/ dx e 2 = 


— , Re a > 0. 


J — oo 


a 



(3.9) 



In the following we will need various generalizations of Eq. (3.9). Firstly, we have f^° dx e ax ^ 2 x 2 
■\j2ir/a 3 , a result established either by substituting a — > a + e in Eq. (3.9) and expanding both 
the left and the right side of the equation to leading order in e, or by differentiating Eq. (3.9). 
Often one encounters integrals where the exponent is not purely quadratic from the outset but 
rather contains both quadratic and linear pieces. The generalization of Eq. (3.9) to this case 
reads 




— % x z -\-bx 



2n 

— e 2a . 
a 



(3.10) 



To prove this identity, one simply eliminates the linear term by means of the change of variables 
x — > x + b/a which transforms the exponent —ax 2 / 2 + bx — > —ax 2 /2 + b 2 /2a. The constant 
factor scales out and we are left with Eq. (3.9). Note that Eq. (3.10) holds even for complex 
b. The reason is that as a result of shifting the integration contour into the complex plane no 
singularities are encountered, i.e. the integral remains invariant. 

Later, we will be concerned with the generalization of the Gaussian integral to complex argu- 
ments. The extension of Eq. (3.9) to this case reads 



f d(z,z) e zwz = — , Re w > 0, 

J w 

where 2 represents the complex conjugate of 2 . Here, f d{z, z) = dx dy represents the 
independent integration over the real and imaginary parts of z = x + iy. The identity is easy to 
prove: owing to the fact that zz = x 2 + y 2 , the integral factorizes into two pieces, each of which is 
equivalent to Eq. (3.9) with a — w. Similarly, it may be checked that the complex generalization 
of Eq. (3.10) is given by 

f d{z,z)e~ zwz+Uz+zv = -e“, Rew > 0. (3.11) 

J w 

More importantly u and v may be independent complex numbers; they need not be related to 
each other by complex conjugation (exercise). 

Gaussian integration in more than one dimension: All of the integrals above have higher- 
dimensional counterparts. Although the real and complex versions of the ./V-dimensional integral 
formulae can be derived in a perfectly analogous manner, it is better to discuss them separately 
in order not to confuse the notation. 



(a) Real case: The multi-dimensional generalization of the prototype integral (3.9) reads 



dv 



(2 7 r) JV/2 detA- 1/2 



(3.12) 





3.2 Construction of the path integral 



103 



where A is a positive definite real symmetric IV-dimensional matrix and v is an TV-component 
real vector. The proof makes use of the fact that A (by virtue of being symmetric) can be 
diagonalized by orthogonal transformation, A = O t DO, where the matrix O is orthogonal, 
and all elements of the diagonal matrix D are positive. The matrix O can be absorbed 
into the integration vector by means of the variable transformation, v Ov, which has 
unit Jacobian, det O = 1. As a result, we are left with a Gaussian integral with exponent 
— v t Dv/2. Due to the diagonality of D, the integral factorizes into TV independent Gaussian 
integrals, each of which contributes a factor ^J2n/di, where di, i — 1, . . . , TV, is the ith entry 
of the matrix D. Noting that f =1 di = detD = det A, Eq. (3.12) is derived. 

The multi-dimensional generalization of Eq. (3.10) reads 




-3v T Av+j T v 



( 2*0 



det A 



(3.13) 



where j is an arbitrary TV-component vector. Equation (3.13) is proven by analogy with 
Eq. (3.10), i.e. by shifting the integration vector according to v — > v + A -1 j, which does not 
change the value of the integral but removes the linear term from the exponent, -)v T Av + 
j T ■ v — > — |v t Av + |j T A _1 j. The resulting integral is of the type (3.12), and we arrive at 
Eq. (3.13). 

The integral (3.13) not only is of importance in its own right, but also serves as a “generator” 
of other useful integral identities. Applying the differentiation operation <9 2 mJn |j=o to the 
left- and the right-hand side of Eq. (3.13), one obtains the identity 7 J dv e - 5 v Av v m Vn = 
(27r) JV//2 det A -1 / 2 A^h- This result can be more compactly formulated as 

{VmVn) — A mn , (3.14) 



where we have introduced the shorthand notation 

(•••) = (2n)~ N/2 detA 1/2 J dv e~? vT Av (■ ■ ■), (3.15) 



suggesting an interpretation of the Gaussian weight as a probability distribution. 

Indeed, the differentiation operation leading to Eq. (3.14) can be iterated. Differentiating four 
times, one obtains (vmV n v q v p ) = + Aiff q A~ p + A~ p A~ q . One way of memorizing the 

structure of this - important - identity is that the Gaussian “expectation” value {v m v n v v v q ) 
is given by all “pairings” of type (3.14) that can be formed from the four components v m . This 
rule generalizes to expectation values of arbitrary order: 2n-fold differentiation of Eq. (3.13) 
yields 



{Vi^Vi 2 . . . Vi 2n ) — 1 '* 2 »' 

pairings of 
{*1 > - - - >*2 n> 



(3.16) 



This result is the mathematical identity underlying Wick s theorem (for real bosonic fields), 
to be discussed in more physical terms below. 



7 Note that the notation .4 ;n ( refers to the mn-element of the matrix A 1 






104 



Feynman path integral 



(b) Complex case: The results above are straightforwardly extended to multi-dimensional com- 
plex Gaussian integrals. The complex version of Eq. (3.12) is given by 

J d(v\ v) e~ vtAv = ^det A -1 , (3-17) 



where v is a complex IV-component vector, ^(v^v) = i=1 d Re Vi d Im Vi, and A is a 

complex matrix with positive definite Hermitian part. (Remember that every matrix can be 
decomposed into a Hermitian and an anti-Hermitian component, A = |(A + A^) + |(A — 
A*).) For Hermitian A, the proof of Eq. (3.17) is analogous to that of Eq. (3.12), i.e. A is 
unitarily diagonalizable, A = AU, the matrices U can be transformed into v, the resulting 
integral factorizes, etc. For non-Hermitian A the proof is more elaborate, if unedifying, and 
we refer to the literature for details. The generalization of Eq. (3.17) to exponents with linear 
contributions reads 



,/ t \ — A Av-fw* -v+vt ■w / N i , . —1 wdA 

a(v,v)e = 7r detA e 



(3.18) 



Note that w and w' may be independent complex vectors. The proof of this identity mirrors 
that of Eq. (3.13), i.e. by effecting the shift +w t , v — » v + w'. 8 As with Eq. (3.13), 

Eq. (3.18) may also serve as a generator of integral identities. Differentiating the integral 
twice according to d^, iSti | w =w'=o gives 

{VmVn) — Anm-> 

where (■■■) = n~ N det A f d(v\ v) e _v1 Av (- • • ). The iteration to more than two derivatives 
gives ( VnVmVpVq ) = A~^A~n + A~„ A~m and, eventually, 



(VilVi 2 ■ 


■■VinVj^Vjz ■ 


V 0n) 


.. A- 1 

jniPn ’ 






P 





where ^2 p represents for the sum over all permutations of n integers. 



Gaussian functional integration: With this preparation, we are in a position to investigate 
the main practice of quantum and statistical field theory - the method of Gaussian functional 
integration. Turning to Eq. (3.13), let us suppose that the components of the vector v parame- 
terize the weight of a real scalar field on the sites of a one-dimensional lattice. In the continuum 
limit, the set {wi} translates to a function v(x), and the matrix Aij is replaced by an operator 
kernel or propagator A[x,x'). In this limit, the natural generalization of Eq. (3.13) is 



Dv(x) exp 



^ J dx dx v{x)A{x, x')v(x') + J dx j(x)v(x) 
oc (det A) -1 / 2 exp ^ J dx dx j(x)A~ 1 {x,x)j{x) 



(3.19) 



For an explanation of why v and may be shifted independently of each other, cf. the analyticity remarks 
made in connection with Eq. (3.11). 






3.2 Construction of the path integral 



105 



where the inverse kernel A 1 (x,x') satisfies the equation 



dx A{x,x) A 1 (x', x") = 5{x — x"), 



(3.20) 



i.e. A~ x [x, x') can be interpreted as the Green function of the operator A(x, x'). The notation 
Dv(x) is used to denote the measure of the functional integral. Although the constant of propor- 
tionality, (2n) N , left out of Eq. (3.19) is formally divergent in the thermodynamic limit N — > oo, 
it does not affect averages that are obtained from derivatives of such integrals. For example, for 
Gaussian distributed functions, Eq. (3.14) has the generalization 



(v(x)v{x)) = A 1 (x,x'). 



Accordingly, Eq. (3.16) assumes the form 



(v(xi)v(x 2 ) ■ ■ ■ v{X2n)) = A 1 ( X kl,Xk 2 )---A 1 (xk 2n _ 1 ,Xk 2n ). 

pairings of 
{x 1 ,...,x 2n } 



(3.21) 



The generalization of the other Gaussian averaging formulae discussed above should be obvious. 

To make sense of Eq. (3.19) one must interpret the meaning of the determinant, det A. When 
the variables entering the Gaussian integral were discrete, the integral simply represented the 
determinant of the (real symmetric) matrix. In the present case, one must interpret A as a 
Hermitian operator having an infinite set of eigenvalues. The determinant simply represents the 
product over this infinite set (see, e.g., Section 3.3). 



Before turning to specific applications of the Feynman path integral, let us stay with the 
general structure of the formalism and identify two fundamental connections of the path 
integral to classical point mechanics and classical and quantum statistical mechanics. 



Path integral and statistical mechanics 

The path integral reveals a connection between quantum mechanics and classical (and 
quantum) statistical mechanics whose importance to all areas of field theory and statis- 
tical physics can hardly be exaggerated. To reveal this link, let us for a moment forget 
about quantum mechanics and consider, by way of an example, a perfectly classical, one- 
dimensional continuum model describing a “flexible string.” We assume that our string is 
held under constant tension, and confined to a “gutter-like potential” (as shown in Fig. 3.2). 
For simplicity, we also assume that the mass density of the string is pretty high, so that 
its fluctuations are “asymptotically slow” (the kinetic contribution to its energy is negligi- 
ble). Transverse fluctuations of the string are then penalized by its line tension, and by the 
external potential. 

Assuming that the transverse displacement of the string u(x) is small, the potential energy 
stored in the string separates into two parts. The first arises from the line tension stored in 
the string, and the second comes from the external potential. Starting with the former, a 
transverse fluctuation of a line segment of length dx by an amount du leads to a potential 







106 



Feynman path integral 




Figure 3.2 A string held under tension and confined to a potential well V. 



energy of magnitude <5Vtension = cr[{dx 2 + du 2 ) 1 ^ 2 — dx\ ~ adx (d x u) 2 / 2, where a denotes the 
tension. Integrating over the length of the string, one obtains Vtensionl^u] = / <5Vtension = 
\ Jq dx cr(d x u(x)) 2 . The second contribution, arising from the external potential, is given 
by Kjxternai M = J, q dx V (u(x)) . Adding the two contributions, we find that the total energy 
of the string is given by V = ^tension + External = Jo dx [f (d x u) 2 + V(u)}. 

EXERCISE Find an expression for the kinetic energy contribution assuming that the string has 
a mass per unit length of m. How does this model compare to the continuum model of lattice 
vibrations discussed in Chapter 1? Convince yourself that in the limit m — » oo, the kinetic 
contribution to the partition function Z = tr [e - ^] is inessential. 



According to the general principles of statistical mechanics, the equilibrium properties of a 
system are encoded in the partition function Z = tr [e - ^] , where “tr” denotes a summation 
over all possible configurations of the system and V is the total potential energy functional. 
Applied to the present case, tr — > f Du, where f Du stands for the functional integration 
over all configurations of the string u(x), x € [0, L). Thus, the partition function of the 
string is given by 



Z = Du exp 



-/3 dx (°^{d x u) 2 + V(u)^J 



(3.22) 



A comparison of this result with Eq. (3.8) shows that the partition function of the classical 
system coincides with the quantum mechanical amplitude 



Z = [dq (q\e- uk / h \q) 


h= 1//3, 


J 


t=-iL 



evaluated at an imaginary “time” t —> — zr = —iL, where H = p 2 /2cr + V{q), and Planck’s 
constant is identified with the “temperature,” h = 1//3. (Here we have assumed that our 
string is subject to periodic boundary conditions.) 

To see this explicitly, let us assume that we had reason to consider quantum propagation 
in imaginary time, i.e. e ~ ltH ! h — y e~ rH ^ h , or t — > —it. Assuming convergence (i.e. positivity 
of the eigenvalues of H), a construction scheme perfectly analogous to the one outlined in 
Section 3.1 would have led to a path integral formula of the structure (3.8). Formally, the 
only differences would be (a) that the Lagrangian would be integrated along the imaginary 
time axis t' — > — zr ' £ [0, —ir\ and (b) that there would be a change of the sign of the kinetic 





3.2 Construction of the path integral 



107 



energy term, i.e. ( dpq ) 2 —> — (d T 'q) 2 . After a suitable exchange of variables, r — > L, h — > 1//3, 
the coincidence of the resulting expression with the partition function (3.22) is clear. 

The connection between quantum mechanics and classical statistical mechanics outlined 
above generalizes to higher dimensions. There are close analogies between quantum field 
theories in d dimensions and classical statistical mechanics in d + 1. (The equality of the 
path integral above with the one-dimensional statistical model is merely the d = 0 version 
of this connection.) In fact, this connection turned out to be one of the major driving forces 
behind the success of path integral techniques in modern held theory/statistical mechanics. 
It offered, for the first time, a possibility to draw connections between systems that had 
seemed unrelated. 

However, the concept of imaginary times not only provides a bridge between quantum 
and classical statistical mechanics, but also plays a role within a purely quantum mechanical 
context. Consider the partition function of a single-particle quantum mechanical system, 



Z = trV H "[ = J dq (q\e- p6 \q). 



The partition function can be interpreted as a trace over the transition amplitude 
(q\e~ lHt / h \q) evaluated at an imaginary time t = — ih/3 . Thus, real time dynamics and 
quantum statistical mechanics can be treated on the same footing, provided that we allow 
for the appearance of imaginary times. 

Later we will see that the concept of imaginary or even generalized complex times plays an 
important role in all of held theory. There is even some nomenclature regarding imaginary 
times. The transformation t —> —ir is described as a Wick rotation (alluding to the 
fact that a multiplication by the imaginary unit can be interpreted as a (7r/2)-rotation 
in the complex plane). Imaginary time representations of Lagrangian actions are termed 
Euclidean, whereas the real time forms are called Minkowski actions. 

INFO The origin of this terminology can be understood by considering the structure of the 
action of, say, the phonon model (1.4). Forgetting for a moment about the magnitude of the 
coupling constants, we see that the action has the bilinear structure ~ x^g 11 x , where p = 0, 1, 
the vector x M = d M <(>, and the diagonal matrix g = diag(— 1, 1) is the two-dimensional version 
of a Minkowski metric. (In three spatial dimensions, g would take the form of the standard 
Minkowski metric of special relativity.) On Wick rotation of the time variable, the factor —1 in 
the metric changes sign to +1, and g becomes a positive definite Euclidean metric. The nature 
of this transformation motivates the notation above. 

Once one has grown accustomed to the idea that the interpretation of time as an imaginary 
quantity can be useful, yet more general concepts can be conceived. For example, one 
may contemplate propagation along temporal contours that are neither purely real nor 
purely imaginary but rather are generally complex. Indeed, it has turned out that path 
integrals with curvilinear integration contours in the complex “time plane” find numerous 
applications in statistical and quantum field theory. 





108 



Feynman path integral 

Semiclassics from the path integral 



In deriving the two path integral representations (3.6) and (3.8) no approximations were 
made. Yet the vast majority of quantum mechanical problems are unsolvable in closed form, 
and it would be hoping for too much to expect that within the path integral approach this 
situation would be any different. In fact no more than the path integrals of problems with a 
quadratic Hamiltonian - corresponding to the quantum mechanical harmonic oscillator and 
generalizations thereof - can be carried out in closed form. Yet what counts more than the 
(rare) availability of exact solutions is the flexibility with which approximation schemes can 
be developed. As to the path integral formulation, it is particularly strong in cases where 
semiclassical limits of quantum theories are explored. Here, by “semiclassical,” we 
mean the limit h — > 0, i.e. the case where the theory is expected to be largely governed by 
classical structures with quantum fluctuations superimposed. 

To see more formally how classical structures enter the path integral approach, let us 
explore Eq. (3.6) and (3.8) in the limit of small h. In this case the path integrals are 
dominated by path configurations with stationary action. (Non-stationary contributions to 
the integral imply massive phase fluctuations that largely average to zero.) Now, since the 
exponents of the two path integrals (3.6) and (3.8) involve the classical action functionals 
in their Hamiltonian and Lagrangian forms respectively, the extremal path configurations 
are simply the solutions of the classical equations of motion, namely, 

Hamiltonian: = 0 => dtX = {H(x),x} = d v H d q x — d q H d p x, 

( O . ZiCj ) 

Lagrangian: SS[q] = 0 => ( d t d q — d q ) L(q, q) = 0. 

These equations are to be solved subject to the boundary conditions g(0) = q\ and q(t) = qf. 
(Note that these boundary conditions do not uniquely specify a solution, i.e. in general there 
are many solutions to the equations (3.23). As an exercise, one may try to invent examples!) 

Now the very fact that the stationary phase configurations are classical does not imply 
that quantum mechanics has disappeared completely. As with saddle-point approximations 
in general, it is not just the saddle-point itself that matters but also the fluctuations around 
it. At least it is necessary to integrate out Gaussian (quadratic) fluctuations around the point 
of stationary phase. In the case of the path integral, fluctuations of the action around the 
stationary phase configurations involve non-classical (in that they do not solve the classical 
equations of motion) trajectories through phase or coordinate space. Before exploring how 
this mechanism works in detail, let us consider the stationary phase analysis of functional 
integrals in general. 

INF Stationary phase approximation: Consider a general functional integral f Dx 
where Dx = limjv->oo n=1 dx n represents a functional measure resulting from taking the con- 
tinuum limit of some finite-dimensional integration space, and the “action” F \x\ may be an 
arbitrary complex functional of x (leading to convergence of the integral). The function resulting 
from taking the limit of infinitely many discretization points, {x n }, is denoted by x : t x(t) 
(where t plays the role of the formerly discrete index n). Evaluating the integral above within a 
stationary phase approximation amounts to performing the following steps: 




3. 2 Construction of the path integral 



109 





Figure 3.3 Quantum fluctuations around a classical path in coordinate space (here we assume a 
set of two-dimensional coordinates). Non-classical paths q fluctuating around a classical solution 
q c i typically extend a distance All paths begin and end at q\ and qt, respectively. 



1. Firstly, find the “points” of stationary phase, i.e. configurations x qualified by the condition 
of vanishing functional derivative (cf. Section 1.2), 



DF X = 0 <=> Vt 



8F [x] 
5x(t) 



= 0 . 



Although there may, in principle, be one or many solutions, for clarity we first discuss the 
case in which the stationary phase configuration x is unique. 

2. Secondly, Taylor expand the functional to second order around x, i.e. 



F[x\ = F[x + y] = F[x\ + ^ J dt j dt' y(t')A(t, t')y(t) H , 



(3.24) 



where A{t,t') = sxtt^Sxit') \ x -x denotes the second functional derivative. Due to the station- 
ary of x, no first-order contribution can appear. 

3. Thirdly, check that the operator A = {A(t, t')} is positive definite. If it is not, there is a 
problem - the integration over the Gaussian fluctuations y diverges. (In practice, where the 
analysis is rooted in a physical context, such eventualities arise only rarely. In situations 
where problems do occur, the resolution can usually be found in a judicious rotation of the 
integration contour.) For positive definite A, however, the functional integral over y can be 
performed, after which one obtains f Dx e _F ^ ~ e~ F ^ det(^r) -1 ^ 2 , (cf. the discussion of 
Gaussian integrals above and, in particular, Eq. (3.19)). 

4. Finally, if there are many stationary phase configurations, ah, the individual contributions 
have to be added: 

/ ~ \ - 1/2 

J Dx e -F ^' ~ r. det ( ^ j • (3.25) 



Equation (3.25) represents the most general form of the stationary phase evaluation of a (real) 
functional integral. 

EXERCISE Applied to the Gamma function, T(z + 1) = J 0 °° dx x z e ~ x , with 2 complex, show 
that the stationary phase approximation is consistent with Stirling’s approximation, i.e. r(s + 
1) = y/2 ffse s(lna - 1) . 




110 



Feynman path integral 



Applied to the Lagrangian form of the Feynman path integral, this program can be imple- 
mented directly. In this case, the extremal field configuration q(t) is identified as the classical 
solution associated with the Lagrangian, i.e. q(t) = q c i(t). Defining r(t) = q(t) — q c i{t) as 
the deviation of a general path, q(t), from a nearby classical path, q c \{t) (see Fig. 3.3), and 
assuming for simplicity that there exists only one classical solution connecting q\ with q$ in 
a time t, a stationary phase analysis obtains 

(qf\e~ i6t/h \qi) ^ e iS[q ^ /h [ Dr exp 

J r(0)=r(t)—0 

(3.26) 



i 

2 h , 



dt' / dt"r(t') 



S 2 S[q } 

Sq(t') 6q(t") 



r(t") 



q=Q ci 



as the Gaussian approximation to the path integral (cf. Eq. (3.24)). For free Lagrangians 
of the form L(q,q) = mq 2 / 2 — V(q), the second functional derivative of the action can be 
straightforwardly computed by means of the rules of functional differentiation formulated 
in Chapter 1. Alternatively, one can obtain this result by simply expanding the action as a 
Taylor series in the deviation r(t). As a result, one obtains (exercise) 




5 2 S[q } 
Sq(t) Sq(t') 



r(t’) 

<7=<?d 



J dt r(t ) [md 2 + V"(q c i(t))] r(t), (3.27) 



where V"(q c \(t)) = d 2 V{q) | g=gcl represents an ordinary derivative of the potential function. 
Thus, the Gaussian integration over r yields the square root of the determinant of the 
operator md 2 + V"(q c \(t)) - interpreted as an operator acting in the space of functions r(t) 
with boundary conditions r(0) = r(t) = 0. (Note that, as we are dealing with a differential 
operator, the issue of boundary conditions is crucial.) 



INFO More generally, Gaussian integration over fluctuations around the stationary phase con- 
figuration obtains the formal expression 



(q f \e lHt/h \qi) ~ det 



/ i G> 2 g[q c i] 
\2n -h dqi dqi 



1/2 



{S[9d] 



(3.28) 



as the final result for the transition amplitude evaluated in the semiclassical approxima- 
tion. (In cases where there is more than one classical solution, the individual contributions have 
to be added.) To derive this expression, one shows that the operator controlling the quadratic 
action (3.27) fulfils some differential relations which can again be related back to the classical 
action. While a detailed formulation of this calculation 9 is beyond the scope of the present text, 
the heuristic interpretation of the result is straightforward, as detailed below. 

According to the rules of quantum mechanics P(q{,qi,t) = \{q(\e~ zHt ^ h \qi)\ 2 defines the prob- 
ability density function for a particle injected at coordinate q\ to arrive at coordinate qt after 
a time t. In the semiclassical approximation, the probability density function assumes the form 
P(q{,qi,t) — | det( ^ 7 ^r)|. We can gain some physical insight into this expression from the 
following consideration: for a fixed initial coordinate gi, the final coordinate q{(qi,Pi) becomes 
a function of the initial momentum pi. The classical probability density function P(qi,q{) can 



See, e.g., Schulman 1 . 




3.2 Construction of the path integral 



111 



then be related to the probability density function P(qi,Pi) for a particle to leave from the initial 
phase space coordinate (qi,Pi) according to 



P(qi,qf)dqidq { 



P(qi,qt{qi,Pi)) 



det 



dqt \ 

dpj 



dqidpi = P(qi,pi)dqidpi. 



Now, if we say that our particle actually left at the phase space coordinate ( qi,pi),P must be 
singular at (qi,Pi) while being zero everywhere else. In quantum mechanics, however, all we 
can say is that our particle was initially confined to a Planck cell centered around (qi,Pi) : 
P(qi,Pi) = l/(27r h) d . We thus conclude that P(qi,q{) = \det(dpi/dqf)\(2Tvh)~ d . Finally, noticing 
that pi = —d qi S we arrive at the result of the semiclassical analysis above. 

In deriving Eq. (3.28) we have restricted 
ourselves to the consideration of quadratic fluc- 
tuations around the classical paths. Under what 
conditions is this semiclassical approximation 
justified? Unfortunately there is no rigorous and 
generally applicable answer to this question. For 
finite h, the quality of the approximation depends 
largely on the sensitivity of the action to path 
variations. Whether or not the approximation is 
legitimate is a question that has to be judged 
di QfidiiPi) from case to case. However, the asymptotic sta- 

bility of the semiclassical approximation in the 
limit h — > 0 can be deduced simply from power counting. From the structure of Eq. (3.28) it 
is clear that the typical magnitude of fluctuations r(t) scales as r ~ (h/SqS) 1 ^ 2 , where S 2 S is 
a symbolic shorthand for the functional variation of the action. (Variations larger than that 
lead to phase fluctuations > 2-7T, thereby being negligible.) Non-Gaussian contributions to the 
action would have the structure ~ h~ 1 r n 5 q S , n > 2. For a typical r, this is of the order 
~ 5% S/(5qS) n / 2 x h n / 2_1 . Since the S-dependent factors are classical (^-independent), these 
contributions scale to zero as h — > 0. 




This concludes the conceptual part of the chapter. Before turning to the discussion of 
applications of the path integral, let us first briefly summarize the main steps taken in its 
construction. 



Construction recipe of the path integral 

Consider a general quantum transition amplitude , where t may be real, 

purely imaginary, or generally complex. To construct a functional integral representation of 
the amplitude: 



1. Partition the time interval into Af > 1 steps, 



e ~iHt/h 



e ~iHAt/H 




At = t/N. 




112 



Feynman path integral 



2. Regroup the operator content appearing in the expansion of each factor e lHAt / h accord- 
ing to the relation 

e ~iHAt/h = J + a tJ2c m nA m B n + 0(Af 2 ), 

mn 

where the eigenstates |a), |6) of A,B are known and the coefficients c mn are c-numbers. 
(In the quantum mechanical application above A = p,B = q.) This “normal ordering” 
procedure emphasizes that distinct quantum mechanical systems may be associated with 
the same classical action. 

3. Insert resolutions of identity according to 

^HAt/n = Y J \a){a\(l + AtY / CmnA m B n +0(At 2 ))\b)(b\ 

a,b \ mn ) 

= Y |a)(a|e" iff(a ’ b)At/fi |6)(6| + 0{At 2 ), 

a,b 

where H(a, b) is the Hamiltonian evaluated at the eigenvalues of A and B. 

4. Regroup terms in the exponent: due to the “mismatch” of the eigenstates at neighboring 
time slices n and n+1, not only the Hamiltonians H(a , 6), but also sums over differences 
of eigenvalues, appear (cf. the last term in the action (3.5)). 

5. Take the continuum limit. 

3.3 Applications of the Feynman path integral 

Having introduced the general machinery of path integration we now turn to the discussion 
of specific applications. Our starting point will be an investigation of a low-energy quantum 
particle confined to a single potential well, and the phenomenon of tunneling in a double 
well. With the latter, we become acquainted with instanton techniques and the role of 
topology in field theory. The ideas developed in this section are generalized further in the 
investigation of quantum mechanical decay and quantum dissipation. Finally, we turn our 
attention to the development of the path integral for quantum mechanical spin and, as a 
case study, explore the semiclassical trace formulae for ciuantum chaos. 

The simplest example of a quantum mechanical problem is that of a free particle 
( H = p 2 /2m). Yet, within the framework of the path integral, this example, which can be 
dealt with straightforwardly by elementary means, is far from trivial: the Gaussian func- 
tional integral engaged in its construction involves divergences which must be regularized 
by rediscretizing the path integral. Nevertheless, its knowledge will be useful as a means to 
normalize the path integral in the applications below. Therefore, we leave it as an exercise 
to show 10 

(3.29) 




10 



Compare this result with the solution of a classical diffusion equation. 





3.3 Applications of the Feynman path integral 



113 



where the Heaviside ©-function reflects causality. 11 

EXERCISE Derive Eq. (3.29) by the standard methodology of quantum mechanics. (Hint: Insert 
a resolution of identity and perform a Gaussian integral.) 

EXERCI 1 : Using the path integral, obtain a perturbative expansion for the scattering amplitude 

(p'\U(t —¥ oo,t' — > — oo)|p) of a free particle from a short-range central potential V(r). In 
particular, show that the first-order term in the expansion recovers the Born scattering amplitude 
-ife- i(t -*' )E{p)/ft 5(£;(p) - E(p'))(p'\V\p). 



Quantum particle in a well 

As a first application of the path integral, let us consider the problem of a quantum particle 
in a one-dimensional potential well (see figure). The discussion of this example illustrates 
how the semiclassical evaluation scheme discussed above works in practice. For simplicity 
we assume the potential to be symmetric, V (q) = V (— q ) with V (0) = 0. The quantity we 
wish to compute is the probability amplitude that a particle injected at q = 0 returns after 
a time t, i.e. with H = p 2 /2m + V(q), G(0,0;f) = (q { = 0\e~ i6t/h \ qi = 0 )0(f). Drawing 
on our previous discussion, the path integral representation of the transition amplitude is 
given by 

G(0, 0; t) = J Dq exp 
«(*)=«( o)=o 

where L = mq 2 / 2 — V(q) represents the corresponding Lagrangian. 

Now, for a generic potential V(q), the path inte- 
gral cannot be evaluated exactly. Instead, we wish 
to invoke the semiclassical analysis outlined above. 

Accordingly, we must first find solutions to the clas- 
sical equation of motion. Minimizing the action with 
respect to variations of q(t), one obtains the Euler- 
Lagrange equation of motion mq = —V'(q). Accord- 
ing to the Feynman path integral, this equation must 
be solved subject to the boundary conditions q{t) = 
namely q c \{t) = 0. Assuming that this is in fact the only solution, 12 we obtain (cf. Eq. (3.26) 




q{{)) = 0. One solution is obvious, 




11 Motivated by its interpretation as a Green function, in the following we refer to the quantum transition prob- 
ability amplitude by the symbol G (as opposed to U used above). 

12 In general, this assumption is wrong. For smooth potentials V(q ), a Taylor expansion of V at small q obtains 
the harmonic oscillator potential, V (q) = Vo + muj 2 q 2 /2 + • • • . For times t that are commensurate with tt/oj, 
one has periodic solutions, q c l(t) oc sin(cjt) that start out from the origin at time t — 0 and revisit it at just 
the right time t. In the next section we will see why the restriction to just the trivial solution was nonetheless 
legitimate (for arbitrary times t ). 




114 



Feynman path integral 



and (3.27)) 



G(0,0;t)~ 



Dr exp 



r(0)=r(£)=0 



l 

h , 



dt'r(t ')y (<9 2 + u 2 )r(t') 



where, by definition, mw 2 = V"(0) is the second derivative of the potential at the origin. 13 
Note that, in this case, the contribution to the action from the stationary phase field con- 
figuration vanishes: S[q c \] = 0. Following the discussion of Section 3.2, Gaussian functional 
integration over r then leads to the semiclassical expansion 

G(0, 0; t) ~ Jdet (-m(<9 2 + w 2 )/2) _1/2 , (3.30) 



where the prefactor J absorbs various constant prefactors. 

Operator determinants are usually most conveniently obtained by presenting them as a 
product over eigenvalues. In the present case, the eigenvalues e n are determined by the 
equation 

m (ca i 2\ 

~b to ) r n — e n T n , 

which is to be solved subject to the boundary condition r n (t) = r„(0) = 0. A complete set of 
solutions to this equation is given by, 14 r n (t') = sin(?i7rf'/f), n = 1,2, ... , with eigenvalues 
e n = m[(mr/t) 2 — w 2 ]/2. Applying this to the determinant, one finds 



det (— to(<9 2 + w 2 )/2) 




To interpret this result, one must make sense of the infinite product (which even seems 
divergent for times commensurate with 7r/w). Moreover the value of the constant J has 
yet to be determined. To resolve these difficulties, one may exploit the facts that (a) we 
do know the transition amplitude Eq. (3.29) of the free particle system, and (b) the latter 
coincides with the transition amplitude G in the special case where the potential V = 0. 
In other words, had we computed Gf ree via the path integral, we would have obtained the 
same constant J and, more importantly, an infinite product like the one above, but with 
lo = 0. This allows the transition amplitude to be regularized as 



G(0, 0; t) 



G(0,0;t) 

Gfree (0, 0; t) 



Gf ree (0, 0; t) = 

n = 1 




f— ) 

V 27 riht J 



1/2 



6(f). 



Then, with the help of the identity n^°=i [1 — {x/mr) 2 ] 1 = x/ sinx, one finally arrives at 
the result 



G(0, 0; t) 



TOW 



27rillsin(wf) 



0(f). 



(3.31) 



13 Those who are uncomfortable with functional differentiation can arrive at the same expression simply by sub- 
stituting q(t) = q c i(t) + r(t) into the action and expanding in r. 

To find the solutions of this equation, recall the structure of the Schrodinger equation of a particle in a one- 
dimensional box of width L — t. 




3.3 Applications of the Feynman path integral 



115 



In the case of the harmonic oscillator, the expansion of the potential necessarily truncates 
at quadratic order and, in this case, the expression above is exact. (For a more wide- 
ranging discussion of the path integral for the quantum harmonic oscillator system, see 
Problem 3.5.) For a general potential, the semiclassical approximation effectively involves 
the replacement of V(q) by a quadratic potential with the same curvature. The calculation 
above also illustrates how coordinate space fluctuations around a completely static solution 
may reinstate the zero-point fluctuations characteristic of quantum mechanical bound states. 



Double well potential: tunneling and instantons 

. As a second application of the path integral let 
/ us now consider the motion of a particle in a dou- 
ble well potential (see figure). Our aim will be to 
estimate the quantum probability amplitude for a 
particle either to stay at the bottom of one of the 
local minima or to go from one minimum to the 
q other. In doing so, it is understood that the energy 
range accessible to the particle (i.e. A E ~ h/t) is 
well below the potential barrier height, i.e. quantum 
\ mechanical transfer between minima is by tunnel- 
\ ing. Here, in contrast to the single well system, it 
is far from clear what kind of classical stationary 
phase solutions may serve as a basis for a description of the quantum dynamics; there appear 
to be no classical paths connecting the two minima. Of course one may think of particles 
“rolling” over the potential hill. Yet, these are singular and, by assumption, energetically 
inaccessible. 

The key to resolving these difficulties is an observation, already made above, that the 
time argument appearing in the path integral should be considered as a general complex 
quantity that can (according to convenience) be set to any value in the complex plane. In 
the present case, a Wick rotation to imaginary times will reveal a stationary point of the 
action. At the end of the calculation, the real time amplitudes we seek can be obtained by 
analytic continuation. 

INFO The mechanism of quantum double (or multiple) well tunneling plays a role in a number 
of problems of condensed matter physics. A prominent example is in the physics of amorphous 
solids such as glasses. 

A caricature of a glass is shown in the figure. The 
absence of long-range order in the system implies that 
individual chemical bonds cannot assume their optimal 
binding lengths. For understretched bonds this leads to 
the formation of two approximately equal metastable min- 
ima around the ideal binding axis (see the inset). The 
energetically lowest excitations of the system are transi- 
tions of individual atoms between nearly degenerate minima of this type, i.e. flips of atoms around 







116 



Feynman path integral 



the binding axis. A prominent phenomenological model 15 describes the system by an ensemble 
of quantum double wells of random center height and width. This model effortlessly explains the 
existence of a vast system of metastable points in the landscape of low-energy configurations of 
glassy systems. 



To be specific, let us consider the imaginary time transition amplitudes 

Ge(cl , ±a; r) = (a| exp | ± a) = Ge(— a , =F a; r), (3.32) 



where the coordinates ±a coincide with the two minima of the potential. From Eq. (3.32) 
the real time amplitudes G(a,±a;t) = Ge(o,, ±o;t — > it) can be recovered by the analytic 
continuation r — » it. According to Section 3.2, the Euclidean path integral formulation 
of the transition amplitudes is given by 



Ge(cl, ±a; r ) = 



Dq exp 



g(0)=±a,g(r)=a 




(3.33) 



where the function q now depends on imaginary time. From Eq. (3.33) we obtain the sta- 
tionary phase (or saddle-point) equations 



-mq + V'{q) = 0. (3.34) 

From this result, one can infer that, as a consequence of the Wick rotation, there is an 
effective inversion of the potential, V — > —V (shown dashed in the figure on page 115). The 
crucial point is that, within the inverted potential landscape, the barrier has become a sink, 
i.e. within the new formulation, there are classical solutions connecting the two points, ±a. 
More precisely, there are three different types of classical solution that fulfill the condition 
to be at coordinates ±a at times 0 and/or r: (a) the solution wherein the particle rests 
permanently at o; 16 (b) the corresponding solution staying at —a; and, most importantly, 
(c) the solution in which the particle leaves its initial position at ±a, accelerates through 
the minimum at 0 and eventually reaches the final position T® at time r. In computing 
the transition amplitudes, all three types of path have to be taken into account. As to 
(a) and (b), by computing quantum fluctuations around these solutions, one can recover 
the physics of the zero-point motion described in Section 3.3 for each well individually. 
(Exercise: Convince yourself that this is true!) Now let us see what happens if the paths 
connecting the two coordinates are added to this picture. 



15 P. W. Anderson, B. I. Halperin, and C. M. Varma, Anomalous low-temperature thermal properties of glasses 
and spin glasses, Phil. Mag. 25 (1972), 1-9. 

16 Note that the potential inversion answers a question that arose above, i.e. whether or not the classical solution 
staying at the bottom of the single well was actually the only one to be considered. As with the double well, 
we could have treated the single well within an imaginary time representation, whereupon the well would have 
become a hill. Clearly, the boundary condition requires the particle to start and finish at the top of the hill, i.e. 
the solution that stays there forever. By formulating the semiclassical expansion around that path, we would 
have obtained Eq. (3.31) with t — »■ — ir, which, upon analytic continuation, would have led back to the real 
time result. 





